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Abstract 

When estimating high-frequency covariance (quadratic covariation) of two ar¬ 
bitrary assets observed asynchronously, simple assumptions, such as indepen¬ 
dence, are usually imposed on the relationship between the prices process and 
the observation times. In this paper, we introduce a general endogenous two- 
dimensional nonparametric model. Because an observation is generated whenever 
an auxiliary process called observation time proeess hits one of the two boundary 
processes, it is called the hitting boundary proeess with time proeess (HBT) model. 

We establish a central limit theorem for the Hayashi-Yoshida (HY) estimator un¬ 
der HBT in the case where the price process and the observation price process 
follow a continuous Ito process. We obtain an asymptotic bias. We provide an es¬ 
timator of the latter as well as a bias-corrected HY estimator of the high-frequency 
covariance. In addition, we give a consistent estimator of the associated standard 
error. 
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1 Introduction 


Covariation between two assets is a crucial quantity in finance. Fundamental examples 
include optimal asset allocation and risk management. In the past few years, using the 
increasing amount of high-frequency data available, many papers have been published 
about estimating this covariance. Suppose that the latent log-price of two arbitrary 
assets Xt = (xj^\ follows a continuous Ito process 

dX^^ := + (1) 

:= /if + af \ (2) 

where are random processes, and and are standard Brow¬ 

nian motions, with (random) high-frequency correlation d{W^^\W^‘^'^)t = ptdt. Econo¬ 
metrics usually seeks to infer the integrated eovariation 

Jo 

Earlier results were focused on estimating the integrated variance of a single asset, 
starting from the probabilistic point of view (Genon-Catalot and Jacod (1993), Ja- 
cod (1994)). Barndorff-Nielsen and Shephard (2001, 2002) introduced the problem in 
econometrics. Adapted to two dimensions, if each process is observed simultaneously at 
(possibly random) times ro,n := 0, , • • •, ^Nr^n the realized eovariation \X‘d\X^‘^'>]^^ 

is dehned as the sum of cross log returns 

5: A.YWA.Y®, (3) 

where for any positive integer i, Axf ^ — Xrf corresponds to the increment 

of the kth process between the last two sampling times. As the observation intervals 
Avi^n get closer (and the number of observations Nn goes to inhnity), ^ 
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(see e.g. Theorem 1.4.47 in Jacod and Shiryaev (2003)). Furthermore, 
when the observation times Ti^n are independent of the prices process Xt, its estimation 
error follows a mixed normal distribution (Jacod and Protter (1998), Zhang (2001), 
Mykland and Zhang (2006)). This gives us insight on how to estimate the integrated 
covariation. However, in practice, these two assumptions are usually not satished. The 
observation times of the two assets are rarely synchronous and there is endogeneity in 
the price sampling times. 


The hrst issue has been studied for a long time. The lack of synchronicity often 
creates undesirable effects in inference. If we sample at very high frequencies, we 
observe the Epps effect (Epps (1979)), i.e. the correlation estimates are drastically 
decreased compared to an estimate with sparse observations. Hayashi and Yoshida 
(2005) introduced the so-called Hayashi-Yoshida estimator (HY) 


{xYx(^))r = Y 


A ■JVPl A 1 , 

(i)ZiA. (2)lr (1) (1). .(2) pa).,.-!, 


'( 2 ) 


i,n ’ 


.( 2 ) 


( 4 ) 


<t 


where are the observation times of the kth asset. Note that if the observations of 
both processes occur simultaneously, ([^ and (|^ are equal. The consistency of this es¬ 
timator was achieved in Hayashi and Yoshida (2005) and Hayashi and Kusuoka (2008). 
The corresponding central limit theorems were investigated in Hayashi and Yoshida 
(2008, 2011) under strong predictability of observation times, which is a more restrictive 
assumption than only assuming they are stopping times but still allows some depen¬ 
dence between prices and observation times. Recently, Koike (2014, 2015) extended the 
pre-averaged Hayashi-Yoshida estimator hrst under predictability of observation times, 
and then under a more general endogenous setting of stopping times. Other examples of 
high-frequency covariance estimators can be found in Zhang (2011), Barndorff-Nielsen 
et ah (2011), Ai't-Sahalia et ah (2010), Christensen et ah (2010, 2013). 


In a general one-dimensional endogenous model, the asymptotic behaviour of the 
realized volatility ([^ has been investigated in the case of sampling times given by hitting 
times on a grid (Fukasawa (2010a), Robert and Rosenbaum (2011, 2012), Fukasawa and 
Rosenbaum (2012)). Due to the regularity of those three models (see the discussion in 
the latter paper), they don’t obtain any bias in the limit distribution of the normalized 
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error. Also, the case of strongly predictable stopping times is treated in Hayashi et al. 
(2011). Finally, two general results (Fukasawa (2010b), Li and al. (2014)) showed that 
we can identify and estimate the asymptotic bias. 

The primary goal of this paper is to bias-correct the HY. Note that estimating 
the bias is more challenging than in the volatility case because observations are asyn¬ 
chronous. In particular, the estimator will involve a quantity that can be considered 
as the tricity of Li et al. (2014), but with a more intricate definition because of the 
asynchronicity in sampling times. This new definition can be seen as an analogy with 
the generalization of the RV estimator (|^ by the HY estimator (|^. 

Another very important issue to address is the estimation of the asymptotic standard 
deviation. First, because the model is more general than in the no-endogeneity work, 
the theoretical asymptotic variance will be different. Consequently, a new variance 
estimator, which takes into proper account the endogeneity, will be given. 

The authors want to take no position on the joint distribution of the log-return 
and the next observation time that corresponds to an asset price change because they 
know that their unknown relationship is most likely contributing to the bias and the 
variance of the high-frequency covariance’s estimate when we (wrongly) assume full 
independence between the price process and observation times. For this purpose, they 
introduce the hitting boundary process with time process (HBT) model. 

Finally, techniques developed in the proofs are innovative in the sense that they 
reduce the normalized error of the Hayashi-Yoshida estimator to a discrete process, 
which is locally a uniformly ergodic homogeneous Markov chain. Thus, the problem 
can be solved locally, and because we assume that the volatility of assets is continuous, 
the error of approximation between the local Markov structure and the real structure 
of the normalized error vanishes asymptotically. This technique is not problem-specific, 
and it can very much be applied to other estimators dealing with temporal data. 

The paper is organized as follows. We introduce the HBT model in Section 2. 
Examples covered by this model are given in Section 3. The main theorem of this 
work, the limit distribution of the normalized error is given in Section 4. Estimators 
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of the asymptotic bias and variance are provided in Section 5. We carry out numerical 
simulations in Section 6 to corroborate the theory. Proofs are developed in Appendix. 


2 Definition of the HBT model 

We hrst introduce the model in 1-dimension. We assume that for any positive integer 
i, Tj+i is the next arrival time (after r*) that corresponds to an actual change of price. 
In particular, several trades can occur at the same price Z^-. between r* and Tj+i, but 
no trade can occur with a price different than before Tj+i. We also assume that Xt 
is the efficient (log) price of the security of interest. In addition, we assume that the 
observations are noisy and that we observe Z^-. := -|- e^-. where the microstructure 

noise can be expressed as a known function of the observed prices Zq, ..., Z^-.. As an 
example, Robert and Rosenbaum (2012) showed in (2.3) in p. 5 that the model with 
uncertainty zones can be written with that noise structure if we assume that we know 
the friction parameter rj. Finally, we dehne a > 0 as the tick size, and we assume that 
the observed price lays on the tick grid, i.e. there exists positive integers rrij such 
that Zr^ := niia. 

Empirically, no economical model based on rational behaviors of agents on the stock 
markets, that shed light on the relationship between the efficient return and time 

before the next price change Ar* = Tj — rj_i, has won unanimous support. When 
arrival times are independent of the asset price, it follows directly from the continuous 
Ito-assumption that the dependence structure is such that the return AX^-^ is a function 
of Atj. The longer we wait, the bigger the variance of the return is expected to be. In 
this paper, we take the opposite point of view by building a model in which r* is dehned 
as a function of the efficient price path. For that purpose, we dehne the observation 
time process that will drive the observation times. We also dehne the down process 
dt{s) and the up process Ut{s). Note that for any t > 0, we assume that dt and Ut are 
functions on M"*". We also assume that the down process takes only negative values and 
that the up process takes only positive values. A new observation time will be generated 
whenever one of those two processes is hit by the increment of the observation time 
process. Then, the increment of the observation time process will start again from 0, 
and the next observation time will be generated whenever it hits the up or the down 
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process. Figure illustrates the HBT model. Formally, we define tq := 0 and for any 
positive integer i as 

Ti := inf |f > Ti-i : ^ [dt {t - n-i) n-i) ] (5) 

where AY\^a,h] '■= Yb — Ya- Note that if the observation time process is equal to the 
price process Xt itself, then the price will go up (respectively go down) whenever it hits 
the up process (down process). Note also that if the time process, the up process and 
the down process are independent of the efficient price process, then the arrival times 
are independent of the efficient price process. We assume that the two-dimensional 
process {Xt.xf'^) is an Ito-process. Section 3.1 provides examples of the literature 
identifying the observation time process, the down process and the up process. 


Generalizing to two dimensions is straightforward. We define xf'^'^ for A; = 1, 2 to be 
the observation time process associated with the fcth price process , the up process, 
the down process, and the arrival times generated by (5). We also define 
the four dimensional process Y) := (Xy\ xj^\ and assume Yt follows an 

Ito-process with volatility 
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In particular, we have dYt = fitdt -|- atdWt, where Wt is a four dimensional standard 
Brownian motion (for i = 1,..., 4 and j = 1,..., 4 such that i ^ j, is independent 
of If we set (t = o'tO'^, then the integrated covariance (or quadratic covariation) 

process is given by {Y, Y)t = /J C,sds. Let pt be the associated correlation process of 
Y), i.e. for z = 1,... ,4 and j = 1,... ,4 we set = Ct’'^(Ct’*)~^- Finally, it is useful 
sometimes to see Yt as a four dimensional vector expressed as in equations ([^ and (|^. 
For /c = 1,... ,4 we define the volatility of the kih. process as := (Cf’^)G we can 
thus express as 

= pf^dt + aPdB[^'^ 


where is a standard Brownian motion, which typically depends on for I = 

1,...,4. 
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3 Examples 


We insist on the fact that estimators of covariance and associated asymptotic variance 
given in this paper don’t reqnire any knowledge of the structure of the observation time 
process, the up process and the down process. Nonetheless, for financial and economic 
interpretation purposes, the reader might be interested in getting an idea on how those 
processes behave in practice. We provide in this section several examples from the 
literature as well as possible extensions of the model with uncertainty zones of Robert 
and Rosenbaum (2011) that can be expressed as HBT models. 


3.1 Endogenous models contained in the HBT class 


Example 1. (hitting constant boundaries) The simplest endogenous semi-parametric 
model we can think of is a model where the time process is equal to the price 
process and times are generated by hitting a constant barrier. Formally, it means 
that there exists a two-dimensional parameter (6*„, 0^) such that the up process is equal 
to 6u and the down process is equal to 6^- We don’t assume noise in that model. 


Example 2. (hitting constant boundaries of the tick size) One issue with Example 
is that the efficient price X^--, which is observed because no microstructure noise is 
assumed in the model, is not necessarily a modulo of the tick size a if 9u and 9d are 
not multiples of a. To make Example 3. 1| feasible in practice, we assume here that the 
constant barriers 9^ and 6*^ are respectively equal to the tick size a and its additive 
inverse —a. We also assume that X := X^.. 


Example 3. (hitting constant boundaries of the jump size) The issue with Example]^ is 
that the absolute jump size of the observed price is a. On the contrary, in practice 
the absolute jump size can actually be bigger than the tick size a. In the notation 
of Robert and Rosenbaum (2011), for any positive integer i, we introduce a discrete 
variables Lj which corresponds to the observed price jump’s tick number between r* 
and Tj+i, with Lj > 1. We assume that Li is bounded. The arrival times are defined 
recursively as tq := 0 and for any positive integer i as 


Ti 


:= inf |f > Ti-i : X* = X^,_^ - Li_ia or X* = X^,_j 
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We assume that Lj are IID and independent of the other quantities. We finally assume 
that := Xt-.. The up and down processes are piecewise constant in t and constant 
in s, defined for any s > 0 as 

dt{s) = -Li_ia for t e (rj_i,ri] 
ut{s) = Li_ia for t e (rj_i,ri] 

Example 4. (model with uncertainty zones) We go one step further than Example 
and introduce now the model with uncertainty zones of Robert and Rosenbaum (2011). 
In a frictionless market, we can assume that a trade with change of price will occur 
whenever the efficient price process crosses one of the mid-tick values Zt-._^ | or 

~ f • fhat case, if the efficient price process hits the former value, we would 
observe an increment of the observed price Z^-^ = Zr^_^ -f a and if it hits the former 
value, we would observe a decrement = Zr^_^ — ol. There are two reasons why in 
practice such a frictionless model is too simplistic. The first reason is that the absolute 
value of the increment (or the decrement) of the observed price can be bigger than the 
tick size a and was already pointed out in Example We will thus keep the notation 
Li in this example. The second reason is that the frictions induce that the transaction 
will not exactly occur when the efficient process is equal to the mid-tick values. For this 
purpose in the notation of Robert and Rosenbaum (2012), let 0 < r/ < 1 be a parameter 
that quantifies the aversion to price changes of the market participants. If we let 
be the value of Xt rounded to the nearest multiple of a, the sampling times are defined 
recursively as tq := 0 and for any positive integer i as 

Ti := inf \t > ri_i : Xt = X^^\ - a(Li_i - ]^+ri) oi Xt = X^^\ + a(Li_i “ ^ } 

The observed price is equal to the rounded efficient price Z^-- := xl^\ The time process 
xf'^ is again equal to the price process Xt itself in this model. The up and down 
processes are piecewise constant in t and constant in s, defined for any s > 0 as 

dt{s) = -Li_ial^Xr^_^<Xr,_^} - ( 2 r 7 + Li_i - 1) for t e (r^.i, n] 

ut{s) = + {2r] + Li_i - 1) for t e {n-i, n] 

where 1a is the indicator function of A. Note that in the case where t] = we are back 
to Example 


Example 5. (times generated by hitting an irregular grid model) The fourth model 
we are looking at is called times generated by hitting an irregular grid model. We 
follow the notation of Fukasawa and Rosenbaum (2012) and consider the irregular grid 
Q = {Pk]k&, with pk < Pk+i- We set Tq = 0 and for i > 1 

Ti = inf |f > Ti_i : XteG - 

where Q — is the set obtained by removing from Q. We can rewrite it 

as an element of the HBT model where the time process is equal to the price process, 
and for all s > 0 the up and down processes are defined as 

dt{s) = pk-i -Pk for t e {Ti-i,Ti] 

Ut{s) = Pk+l - Pk for t e 

where k is the (random) index such that pk = 

Example 6. (structural autoregressive conditional duration model) There have been 
several drafts for this model. We follow here a former version (Renault et ah (2009)), 
because we can directly express it as an element of the HBT modeQ In the structural 
autoregressive conditional duration model, the time when the next event occurs is 
given by To = 0 and for i > 0 

Ti = inf |f > Ti-i : At - = dr,_t or At - = c^,_j | (6) 

where At is a standard Brownian motion (not necessarily independent of Xt). Expressed 
as an element of the HBT model, we have that the time process is equal to the 
Brownian motion At and for all s > 0 

dt{s) = dri_t for t e 
Ut{s) = Cr,_i for t e 

^Generating the sampling times (j^ of the HBT model as a first hitting-time of a unique barrier 
instead of the first hitting time of one of two barriers as in the latter version of Renault et al. (2014) 
wouldn’t change much the proofs of this paper, but we chose the two-boundaries setting because it 
seems more natural if interpretation of time processes, up processes and down processes is needed. 
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3.2 Possible extensions of the model with uncertainty zones 

The model with uncertainty zones of Robert and Rosenbaum (2011) introduced in 
Example which is semi-parametric, assumes that the observed price is the efficient 
price rounded to the nearest tick value and thus the noise is equal to e* := 

a{^—ri) if the last trade increased the price and e* := —a(|—r^) if the last trade decreased 
the price. In particular, the noise is auto-correlated and correlated to the efficient 
price. Because of this specific noise distribution, it is directly possible to estimate the 
underlying friction parameter r] without any data pre-processing such as preaveraging 
(see Robert and Rosenbaum (2012)). We believe the model with uncertainty zones 
is a very interesting starting point, because all the endogenous and noise structure 
of the model is reduced to the estimation of the 1-dimensional friction parameter r]. 
Nevertheless, as this semi-parametric model wants to be the simplest, it suffers from 
several issues. We will investigate two of them in the following. 

First, the model doesn’t allow for asymmetric information between the buyers and 
the sellers. Define ri~^ and 7]~, which are respectively the aversion to a positive price 
change and a negative price change. As a positive price change means that a buyer 
decided to put an order at the best ask price and a negative price change corresponds 
to a seller that puts an order at the best bid price (if we assume that cancel and repost 
orders are not the reason why the price changed), the difference — ri~ can be seen 
as a measure of information asymmetry. We define tq := 0 and recursively for i any 
positive integer 

Ti := inf |f > Ti-i : Xt = - a(Li -^ + r]~) or ^ 

Note that the HBT class contains this model and that it can be directly fitted if we 
slightly modify fj in Robert and Rosenbaum (2012) to estimate and ri~. One possible 
application would be to build a test of asymmetric information t]~^ := t]~ . This is beyond 
the scope of this paper. 

One other issue is that the authors don’t do any model checking in their work. 
According to their empirical work (see pp. 359-361 of Robert and Rosenbaum (2011)), 
the estimated values for rj are stable accross days for the ten French assets tested. 
Stability of rj favors their model but by doing so, the model doesn’t allow any other 
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structure than the full-endogeneity for the sampling times. Even if the true structure 
of sampling times is (mostly) independent of the asset price, we will still estimate an 
r] that will be stable across days. If we allow the time process to be different from 
the price process itself, we can estimate the correlation between them and see 
how endogenous the sampling times are (the bigger is, the more endogenous the 
sampling times are). We would need to add more general microstructure noise in the 
model, and thus this is left for further work. 

4 Main result 

4.1 Assumptions and Theorem 

Without loss of generality, we fix the horizon time T := 1, and we consider [0,1] to 
represent the course of an economic event, such as a trading day. We first introduce the 
definition of stable convergence, which is a little bit stronger than usual convergence 
in distribution and needed for statistical purposes of inference, such as the prediction 
value of the high-frequency covariance and the construction of a confidence interval at 
a given conhdence level. 

Definition 1. We suppose that the random processes Yt, fit and at are adapted to a 
filtration {J-t)- Let Zn be a sequence of J^i-measurable random variables. We say that 
Zn converges stably in distribution to Z as u —)■ cx) if Z is measurable with respect to 
an extension of J^i so that for all and for all bounded continuou^ functions /, 

E [1a/ (^n)] -t E [1a/ {Z)] as n -)■ cx). 

In the setting of Section 2, the target of inference, the integrated covariation, can 
be written for all f G [0,1] as 

Jo 

^Note that the continuity of / refers to continuity with respect to the Skorokhod topology of D[0,1]. 
Nevertheless, we can also use continuity given by the sup-norm, because all our limits are in C[0,1]. 
One can look at Chapter VI of Jacod and Shiryaev (2003) as a reference. For further definition of 
stable convergence, one can look at Renyi (1963), Aldous and Eagleson (1978), Chapter 3 (p. 56) of 
Hall and Heyde (1980), Rootzen (1980), and Section 2 (pp. 169-170) of Jacod and Protter (1998). 
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We are providing now the asymptotics. We want to make the number of observations 
go to inhnity asymptotically. The idea is to scale and thus keep the structure that 
drives the next return and the next observation time, while making the tick size vanish 
(and thus the number of observations explode on [0,1]). Formally, we let the tick size 
a > 0 and we define the observation times Tq, := such that for fc = 1, 2 we 

have Tq^ := 0 and for i any positive integer 




T- := inf \t> ^ 




We define the HY estimator when the tick size is equal to a as 


{A'wYp))"'' := ^ 


i,ct j,a I, 1'i—l,c 


'( 2 ) 






.r(Yn[r)Y n 


( 2 ) 


J,oc 


)^ 0 }' 


( 7 ) 


We now give the assumptions needed to prove the central limit theorem of ([^. We need 
to introduce some definitions for this purpose. In view of the different models introduced 
in Section 3, there are three different possible assumptions regarding the correlation 
between the time processes and the price processes Xt. The first possibility is that 
they can be equal for all 0 < f < T. In this case we define as the smallest eigen¬ 
value of The second scenario is that for one fc G 1, 2 we have xj^^ := xf'^\ 

but the other time process is different from its associated price process. In that case, we 
define the smallest eigen-value of 3 ’Ijllfc+I} • The third possible setting 

is that the time process is different from its associated asset price for both assets, and we 
let Af “ the smallest eigen-value of at in that case. Assumption (Al) provides conditions 
on the price processes x[^'^ and x[^\ the time processes and as well as their 
covariance matrix a*. There are two types of assumptions in (Al). First, we want to get 
rid of the drift in the proofs, and this will be done using condition (Al) together with 
the Girsanov theorem and local arguments (see e.g. pp.158-161 in Mykland and Zhang 
(2012)). This is a very standard assumption in the literature of financial econometrics. 
Furthermore, we assume that the covariance matrix at is continuous. 


Assumption (Al). The drift fit^ fhe volatility matrix at and the (four dimensional) 

Brownian motion Wt are adapted to a filtration (XJ. Also, fit is integrable and locally 

bounded. Furthermore, at is continuous. Finally, we assume that inf AF“ > 0 a.s. 

te(o,i] 
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Remark 1. (robustness to jumps in volatility) The proof techniques, holding the volatil¬ 
ity constant on small blocks, require the "continuity of volatility". This is the same 
strategy as in Mykland and Zhang (2009) and Mykland (2012) where the volatility pro¬ 
cess follows a continuous ltd process. Nonetheless, following the same line of reasoning 
as for the proof of Remark we can add a finite number of jumps in the volatility 
matrix. The proof of Theorem will break in the case of infinite number of jumps in 
CTf 

The following condition roughly assumes that both time processes can’t be equal to 
each other, even on a very small time interval. Specifically, we will assume that there 
is a constant strictly smaller than 1 such that the module of the instantaneous high- 
frequency correlation can’t be bigger than this constant. In practice, assumption 
(^42) is harmless. 

Assumption (A2). For all t G [0,1] we have 


Pt^ 


, r 3,4 3,41 

^ [P- )P+ J) 


( 8 ) 


where max(| |, | |) < 1. 


The next assumption deals with the down process dt and the up process Ut- It is 
clear that dt and Ut have to be known with information at time t, which is why we 
assume that they are adapted to {iFt)- The rest of assumption (A3) is very technical 
and we only try to be as general as we can with respect to the proof techniques we will 
use. The reader should understand Assumption (A3) as “assume the worst dependence 
structure possible between the return AX^-. and the time increment Ar*, knowing that 
they follow the HBT model”. We insist once again on the fact that we only make the 
dependence structure as bad as we can in our model so that we can investigate how 
biased the HY estimator can be in practice, and how much the estimates of the variance 
assuming no endogeneity are wrong. 


Assumption (A3). For both assets k = 1,2, define the couple of the down process 
and the up process := {d!'t‘\'^f^) let gt := {g‘i^\gf'’)- We assume that 

g{k) . ]^+ X M+) 
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is adapted to {J-t)- Moreover, there exists two non-random constants ^ < g < g^ snch 
that a.s. for any t G [0,1] and for any s > 0 

g- < min(-df^(s),'uf^(s)) < max(-df^(s), nf^(s)) < g^ (9) 

Fnrthermore, there exists non-random constants K > 0 and d > 1/2 such that a.s. 

^s>K,gt{s)=gt{K), ( 10 ) 


Vt > 0, gt is differentiable and Vs > 0, max (|(d|^^)'(s)|, |(m|^^)^(s)|) < K, (11) 


V (n, v) e [0,1]^ s.t. 0 < M < n, - 5f„||oo < K\v - m|'^, 
where ||(/i,/ 2 )||oo = supmax(|/i (w) |, I /2 (w) |). 

w>0 

Remark 2. Consider the space C of constants defined in Assumption (A3) 


( 12 ) 


C : = 


',g^,K,d) s.t. 0 < g < g^ , K > 0 , d > . 


For any c G C, we define Q{c) to be the functional subspace of M’*' —)• (M’*' —)■ M x M’*')^ 
such that '^g & Q, g satisfies (§. ( [To| , ( [TT| and ( p!^ . When there is no room for 
confusion, we use Q. Assumption (A3) is equivalent to 

3c G C s.t. 'it G [0,1] , G G{c). 

Remark 3. The advised reader will have noticed that Example Example Example 
and Example where time processes are piecewise-constant and may depend on n, 
don’t follow Assumption (A3). The adaptation of Theoremproofs in those examples 


is discussed in Appendix 8A We have made the choice not to state more general 
conditions to keep tractability of Assumption (A3). 

The last assumption is only technical, and also appears in the literature (Mykland 
and Zhang (2012), Li et ah (2014)). 

Assumption (A4). The filtration (J3) is generated by finitely many Brownian mo¬ 
tions. 
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We can now state the main theorem. 

Theorem 1. Assume (^1) — (^4). Then, there exist processes ABt and AVt adapted 
to {Bt) such that stably in law as the tick size a —)■ 0, 

^ ABt + {AVsf/^dZ,, (13) 

where Zt is a Brownian motion independent of the underlying a-field. The asymptotic 
bias ABt and the asymptotic variance AVt are defined in Section 4.3 and estimated in 
Section 5. 


Remark 4. (path-bias) Note that the asymptotic bias term ABt on the right-hand side 
of (13) doesn’t mean that the Hayashi-Yoshida estimator is biased, but rather path- 
biased. The latter is a weaker statement which means that once we have seen a path, 
there is a bias for the HY estimator on this specihc path of value ABt. In practice, 
we only get to see one path and thus bias and path-bias can be confused easily. When 
doing simulations, we can observe many paths and the reader should keep in mind that 
the path-bias will be different for each path. In addition, note that if we assume that at 
is bounded and bounded away from 0 on [0,T], there is no bias in Theorem because 
E[ABt\ = 0. 

Remark 5. (convergence rate) At hrst glance, the convergence rate a~^ looks different 
from the optimal rate of convergence obtain in the no-endogeneity case. This 

is merely a change of perspective because we are looking from the tick size point-of- 
view. Actually, if for fc = 1, 2 we dehne Nt^^ as the number of observations before t of 


the kth asset and the sum of observations of both processes Nt^J := Nt]l Ntfi^, we 
have that Nt^J is exactly of order Op{a~‘^). Thus, if we dehne the expected number of 
observations n := , we obtain the optimal rate of convergence in (13). 

Remark 6. (robustness to jumps in price processes) We assume that we add a jump 
component to the price process 


r(2) 


dxf ^ = /if + af ^ + djf ^ (14) 

for k = 1,2, where Jt denotes a 2-dimensional hnite activity jump process and dj[^'^ 
is either zero (no jump) or a real number indicating the size of the jump at time t. 
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We follow exactly the setting of p. 2 in Andersen et al. (2012). We assume that 
Jt is a general Poisson process independent of the other quantities. Under the same 
assumptions the conclusion of Theorem remains valid. The proof can be found in 
Appendix 8.6 The inhnitely many jumps case is complex and beyond the scope of this 
paper. This was already the case in the 1-dimensional case (see Remark 4 in p. 586 of 
Li et al. (2014)). 


Remark 7. (grid on the original non-log scale) Theorem covers the particular case 
where W corresponds to the log-price and observations are obtained when the price on 
the original scale hits a boundary. This can be done by a reparametrization of by 
:= (-exp(-4^^),exp(Mf^)). 


Remark 8. (arbitrary number of assets) The authors chose for simplicity to work only 
with two assets, but they conjecture that this result would stay true for an arbitrary 
number of assets, and that our proofs would adapt to show it, at the cost of more 
involved notations and dehnitions. 


4.2 Definition of the bias-corrected HY estimator 

Assume that we have a consistent estimatoi0 of the bias ABt^a '■= aABt. Such 

estimator will be provided in Section 5. We define the new estimator of 

high-frequency covariance as the estimate obtained when removing the bias estimate 
ABt^a from the Hayashi-Yoshida estimator 

(15) 

With the bias-corrected estimator (XP), we get rid of the asymptotic bias and 

keep the same asymptotic variance as we can see in the following corollary. 

Corollary 2. Assume (Al) — (A4). Then, stably in law as a — ?• 0, 

0-1 ^ {AVsf‘^dZ,. ( 16 ) 

^ABt^a is consistent means that a~^ABt^a = oT ^ABt^a + Op(l) 
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4.3 Computation of the theoretical asymptotic bias and asymp¬ 
totic variance 


We warn the reader interested in implementing the bias-corrected estimator that this 
section is highly technical and we advise her to go directly to Section and refer to 
this section only for the dehnitions. On the contrary, if the reader wants to nnderstand 
the main ideas of the proofs, she shonld take this section as a reference. We also want 
to emphasize on the fact that the theoretical values of asymptotic bias and asymptotic 
variance found at the end of this section are rather abstract and don’t shed easily light 
on how the change of parameters at and gt in the model would influence the asymptotic 
bias and asymptotic variance. The main purpose of this paper is that we don’t need to 
know the theoretical values in order to compute the estimators in Section 


We need to introduce some dehnitions in order to compute the theoretical asymp¬ 
totic bias ABt and the asymptotic variance term AVt- We hrst need to rewrite the HY 
estimator Q in a different way. For any positive integer i, consider the iih. sampling 
time of the hrst asset We dehne two random times, TjTi^Q and which 

are functions of observation times of the second asset {Tja}i>o, and 

which correspond respectively to the closest sampling time of the second asset that is 
strictly smaller than and the closest sampling time of the second asset that is 

(not necessarily strictly) bigger than ^ as 

= 0 , 

= max{rj2 : rjS < for * > 2, 

• r (2) 

= 


Tr 


0,a 


T, 


2—1,a 

7 -+ 

' 2—1,a 


-( 2 ) 


> XX} for i > 1. 


We consider AX 


( 2 ) 


the increment of the second asset between q, and 


AX 


( 2 ) ._ 


= AX, 


( 2 ) 


r“ ,T+ 1 

1 — 1,0:’ 1,0 J 


Rearranging the terms in ([^ gives us (except for a few terms at the edge) 

(X^(2)),,„= V AX^AX^!)^. 


(17) 

(18) 

(19) 

( 20 ) 

( 21 ) 




Connoisseurs will have noticed that 


2—1,a 


is not a Yj-stopping time, which will not be a problem 


in the proofs 
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The representation in (21) is very useful in the sense that it gives a natural order 


between the terms in the sum. Nevertheless, any term of this sum is a priori correlated 


with the other terms. We will rearrange once again the terms in (21), so that each term 


is only correlated with the previous and the next term of the sum. In this case, we 
say that they are 1-correlated. For this purpose, we need to introduce some notation. 
We remind the reader that Tq, is the two-dimensional vector of sampling times, where 
for each k = 1,2 the kth component is equal to the sequence of sampling times 
associated with the kih. asset. We will construct a subsequence of that also 
depends on the observation times of the second asset To , and will be such that we 


can write the Hayashi-Yoshida estimator as a 1-correlated sum similar to (21), except 
the new sampling times will replace the original observation times The new 
sampling times are obtained using the following algorithm. We define Tq^ := 


and recursively for i any nonnegative integer 

:= min • there exists j G N such that 




1C 


2 + 1 , 0 ; 


<+n. 

' 2,0 — ' J,0 ^ ' 22,0 J 


( 22 ) 


In words, if we sit at the observation time of the first asset, we wait first to hit 


2,0 


an observation time of the second asset, and we then choose the next strictly bigger 
observation time of the first asset. In analogy with ( [1^ , ( |I^ , (19) and (20), we define 
the following times 


AY 


IC- 

■■= 0, 

(23) 

1C- 

'T- ’ 

' 2—1,0 

:= max{rfj : rfj < tI%} lor t > 2 

(24) 

,1C: + 
2 —1,0; 

:= min{rj^^2 : rfj > for i > 1, 

(25) 

-(2) 

1,0 

;= AyJ^i^c- ic,+, for i > 1. 

(26) 


First, observe that, except for maybe a few terms at the edge, we can rewrite (21) as 

= Y W!AW,_.+. 


(27) 


1C,+ 


<t 


Also, we define the following compensated increments of the HY estimator 

e/ds. (28) 


= AY^J^AY 


1 , 0 . 


( 2 ) 

ic,-,+ 


r.lC 

z,a 


tIC 

U-l.a 
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Note that they are compensated in the sense that they are centered (if we decompose 
AX'-ic _ + into a left (—), a central and a right (+) part and condition the expectation, 
this is straightforward to show). Similarly, we can show that they are 1-correlated. 


The idea of the proof is the following. If we consider the volatility matrix at and 
the grid function gt to be constant over time, we can express the conditional returns of 
the normalized error of HY as a homogeneous Markov chain (of order 1), show that the 
Markov chain is uniformly ergodic and thus use results in the limit theory of Markov 
chains (see, e.g., Meyn and Tweedie (2009)) to show that it has a stationary distribution. 
Then, we prove that we can approximate locally the returns of the normalized error 
when the volatility matrix and grid function are not constant by the returns when 
holding them constant on a small block. Finally, using limit theory techniques developed 
in Mykland and Zhang (2012) together with standard probability results of conditional 
distribution (see, e.g., Breiman (1992)), we can bound uniformly in time the error of 
the returns when holding the volatility matrix and grid function constant. 


Based on the dehnitions introduced in Appendix |8.1| , we can dehne the instantaneous 
variance of the normalized HY estimate’s error (29), which depends on the volatility 
matrix a and the grid g. Similarly, we also define the instantaneous covariance be¬ 
tween the normalized HY’s error and the hrst asset price (30), and the instantaneous 
covariance between the error and the second asset price (31). Finally, we dehne the 
instantaneous 1-correlated time, which is the approximation of ]E.,-ic , where if r 

is a (J^t)-stopping time, [Y] is dehned as the conditional distribution of Y given Jv. 


ij^^{a,g,x,u) 

:= E[iV2 + 2iV2iV3], 

(29) 

ij^^\a,g,x,u) 

:= E[N2AX^^^a], 

(30) 


:= EfWAX^l.,^], 

f'2 

(31) 

^^{a,g,x,u) 

:= E[Ar,^^]. 

(32) 


Remark 9. The reader might expect Ni in lieu of N 2 in (29), (30), (31) and (32). 
Actually, we cannot use Ni directly from the dehnition because the corresponding time 
Tq ’ =0. We would need to set it to —u to alter the dehnition of (29), (30), (31) and 
(32), which we have chosen not to do for the sake of clarity. 
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Set Zq := (x, u) and for any positive integer i 


:= (AX 


(4) 


1C,- -icn 


1C 


-IC,- 


r, 


)■ 


(33) 


For any nonnegative integer i, we consider TTi{a,g,x,u) the distribntion of Zj. We 
also introdnce the notation Il{a,g,x,u) := {ni{a,g,x,u)}i>o. By the strong Markov 
property of Brownian motion, we can show that Zj is a homogeneons Markov chain (of 
order 1) on the state space Sg. In the following lemma, we show that there exists a 
stationary distribution of ni{a,g,x,u). 

Lemma 3. Let c := {g~,g~^,K,d) be a four-dimensional vector such that c & C and 
consider a a constant volatility matrix such that > 0 and g & Q{c) a constant grid. 
Then, there exists a stationary distribution 7i{d,g). 


The proof of Lemmacan be found in the Appendix (proof of Lemma 14). The next 
dehnition is the average (regarding the stationary distributions) of the instantaneous 
variance, covariances and 1-correlated time. For any 6 G {AV, ACl, AC2, r}, 

0^(5-,^):= / {d,g,y,v)dTT{d,g){y,v). 

We introduce the notation := (j)^ (cXs, gs) and consider the following quantities needed 
to compute the asymptotic bias and variance. 

fcP := (34) 


We express now AVg the quantity integrated to obtain the asymptotic variance. 

AV. := - (tp + fcp),),yp) {v.y' 


(36) 


- (pi^) 

The asymptotic bias is defined as AB. AB^dX^ + Jl AB^dX^ where 


0 

-1 


abw := kP-kypyamyrny', 


ABy := V.A. 
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(37) 

(38) 



Remark 10. (asymptotic bias) Looking at the expressions for and ABf‘\ one can 

be tempted to think that because of the (l —^ term in the bias will increase 
drastically when both assets are highly correlated. In this case, the reader should keep 
in mind that the second term of AB^P ^ when integrated with respect to and 

ABl , when integrated with respect to Xs , will be roughly of the same magnitude, 
with opposite signs, and thus there is no explosion of asymptotic bias. We chose the 
above asymptotic bias’ representation because it is straightforward to build estimators 
from it. We can also express the asymptotic bias differently. For this purpose, we can 
rewrite the log-price process as 

dAf = aPiB™, 

where Bj^^ and are independent Brownian motions. Let 

dXl’^ = (1 - (39) 

be the part of that is not correlated with x[^\ We can express the asymptotic 
bias as ABt = AB^^"’dXs^^ -|- AB^f’dB]'^. In this case, AB^^^ = k^P and 

ABf = lim 

n^oo 

where M” is dehned in the proofs. We can show that this limit exists, and does not 
explode when both assets are highly correlated. 


5 Estimation of the bias and variance 


We need to introduce some new notations. We recall that is the number of obser¬ 
vations corresponding to the hrst asset before 1 and we also dehne the number of 
1-correlated observations before 1, i.e. := max{i G N s.t. < 1}. In practice, 
the hrst step is to transform the returns of the hrst asset 


into 1-correlated returns 


i,a. 

{(AA;S.A,;y)}fjS 
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using algorithm (22). Then, for each asset, we will chop the data into Bn blocks and 

- ---(1) -^(2) 

on each block i = 1,..., Bn we will estimate AVi^a, AB^^^ and AB^^, pretending that 
the volatility matrix at and grid gt are block-constant. 

Because there is asynchronicity in the observation times, the blocks of each asset 
are not exactly equal. Let hn be the block size. For the first asset, we consider block 
1^^^ := [0, block 2^^^ := „], etc. For the second asset, we let block := 

b^ock 2(2) := etc. In the following, we will say j G block i(b 

when Tj]l G block i(^). Similarly, we say j G block *(2) when G block *(2). Finally, 

we define j G block i if j G {(i — l)h„ -|- 1,... ,ihn}- First, we estimate the volatility 
of both assets using the corrected estimator in Li et ah (2014). To do this, we need to 
define an estimate of the spot volatility on each block for each asset /c = 1,2 by 


a, 


(fc) 

i^a 


(k) 


jGblock 


Then, we estimate the asymptotic bias of the volatility via 


(fc) 

ABa,^ = 


r(^)'l2 




j£block 

We obtain the bias-corrected estimators of volatility on each block: 


Ak) ~(k) 

a- = a 

i,a i,a 


ABa 


(k) 


Then, we estimate the correlation between both assets using the naive HY estimator 

1 


- 1,2 

PLa = 


dl)-(2) 


E 


jgblock i 


( 1 ) A ■v( 2 ) 

i,a yj'.o: 


We then build an estimator of the compensated increments of the HY estimator, fol¬ 


lowing the definition in (28), 


N, 




i,a 


-(2^ 

'^i,a 


;^l,2 


The next step is to estimate the instantaneous variance (29), both instantaneous covari¬ 


ances (30) and (31) and the instantaneous 1-correlated time (32) on each block. This 
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is done by taking the sample average of the corresponding estimated quantities. Note 
that we don’t directly estimate and , but rather a scaling version 

of them, i.e. , anXp'^'^^ i an'ip^^^ and an'ip'^■ In practice, we can always assume 

Q!„ := 1 by scaling by the tick size, and thus we match the dehnitions of the following 
estimators with (29)-(32). For the sake of simplicity, we assume that the number of 
1-correlated observations of the last block Bn is also In practice, this will be most 
likely different from and thus the denominator of (40)-(43) will have to be changed 
so that it is equal to the number of 1-correlated observations in this last block. The 
estimates are given by 


6^^ 

V^2,0 

•— hn^ E Ea + 

(40) 


jGblock 2 


W2,0 

:= E 

(41) 


j-Gblock 2 


W2,0 

- ft;' E WAA'py.,, 

(42) 


j,a 

jGblock 2 


01,a 

:= K" E 

(43) 


jGblock 2 



We estimate now the quantities ( |34[ ) and (35) as 


2 , 0 ; 




(2)^-2;;AC2 _ 


r2,or2,0 ) 


-1 


(44) 

(45) 


We follow (37) and (38) to estimate the bias integrated terms and AB'f'^ on each 

block 


AB 

AB 


2,0 

( 2 ) 

2,0 


:= k) 


( 1 ) 


PT-i,2^(2) 

^ 2,0 r 2 , 0 ^ 2,0 V ^ 2,0 / ’ 


:= k 


1,-L 


For the variance term AVg, we decide not to use the direct dehnition in (36) because it 
can provide negative estimates. Instead, we will be using the following estimator 


klWa : = 


jGblock 2 

(2) _ -^(2) 

.1C 

{i—l)hn,OL 


( 1 ) 

AC 

ihfi , q : 


-X 


( 1 ) 


■1C , 

(i-l)hn,a 


^i,a[\^lC,+ ^1C,+ ) Pi,a^i,a\^i,a) 

ihn ,ct / • ^ T 


( 1 ) 

1C 


-X 


( 1 ) 


1C )) 

(i—l)hn,a 
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We define the final estimators of asymptotic bias and asymptotic variance as 

Bn 


AB, := 


i=l 

Bn 


,1C 

' ihji ,a 


( 1 ) 




( 2 ) 


^(2) 


_ 

1C,+ ^ 1C,+ 


(i—l)hn,Q: 


). ( 46 ) 


:= 5^27 


1C 

*,« \'ihn,a 


- i 

'{i-l)hn,a) ■ 


(47) 


Z=1 


As a corollary of Theorem 1, we obtain the following result, which states the consistency 
of (46) and (47). 

Corollary 4. There exists a choice of the block size such that when a —)■ 0, we 
have 


a ^ABc 

a-^^r 


A ABi, 

7-1 


-)■ 


AVAs. 


(48) 

(49) 


In particular, in view of Corollary 2, the bias-corrected estimator := 

^ — ABa is such that 




AV 


1/2 


—h A/'(0,1). 


(50) 


Remark 11. (exchanging and xf’'^) When estimating the asymptotic bias and the 
asymptotic variance, we considered one specihc asset to be and the other one to 
be x]^\ We could exchange X^^'^ and xl‘^\ and hnd new estimators AB^ and AVa 
according to the previous dehnitions. One could then take (respectively 

AVa+AVt,a ^ final estimators of asymptotic bias (asymptotic variance). 

Remark 12. (optimal block size) In practice, the optimal block size hn is not straight¬ 
forward to choose. On the one hand, hn should be as small as possible so that the 
volatility matrix at and the grid gt are almost constant on each block, and thus (40)- 
(43) are less biased. On the other hand, we need as many observations as we can on 
each block, so that the variance of approximations (40)-(43) is not too big. We are 
facing here the usual bias-variance tradeoff. 


^the exact assumptions on hn can be found in the proofs of Theorem 1 
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6 Numerical simulations 


We consider four different settings in this part. We describe here the first one. We 
assume the same setting as the toy model described in Example [T| in two dimensions. 
Thus, there exists a four-dimensional parameter 6 := such that for 

any f > 0 and any s > 0, := 0u \ d[^\s) := 6ll\ := 6u^ and d[^\s) := 

We assume that the two-dimensional price process has a null-drift. Also, we 

assume that the volatility of the first process is := where := 0.016 and the 
volatility of the second process aj:^^ := d^^^ where d^^^ := 0.02, and that the correlation 
between both assets is := 0.2. We set 0 := (0.0007, 0.0001, 0.0006, O.OOOl). Accord¬ 
ing to this rule, a change of price occurs whenever the price of the first (respectively 
second) asset increases by 0.07% (0.06%) or decreases by 0.01% (0.01%). Finally, we 
assume that the price processes (xj^\ xj^^) and the time processes are 

equal. 

The second setting is similar to the first setting, except that we assume now a 
stochastic volatility Heston model. Specifically, we assume that 

:= p^^^dt + at^dBi^\ 

where the constant high-frequency covariance between and B^^^ is fixed to and 
(.b|^\ Hj^^) are uncorrelated with each other. We choose to work with drift := 

(0.03,0.02), and to add leverage effect (p^^\p^^^) are selected to be (—0.8,—0.7). Fi¬ 
nally, {K!d\Kp‘^) := (4.5,5.5), the volatility of volatility := (0.4, 0.5), and the 

volatility starting values (cto^^cTo^^) := {d^^\ . 

We consider now the third setting, which goes one step further than the previous set¬ 
ting. We assume a jump-diffusion model for both the price and the volatility. Formally, 
we assume that 

:= + erf ^ \ 

d(afV := - (afV)dt + ^ + djf\ 

where the jumps {Jt^\ Jt^\ follow a 4-dimensional Poisson process with in¬ 
tensity (A^^\ := (12,11,10,9). The jump sizes are taken to be 1 or —1 
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with probability | for price processes, and 0.0001 or —0.0001 with half-probability for 
volatility processes. 

In the fourth setting, we consider another model of arrival times, namely Example 
1^ We set the tick size a = 0.0001 and the friction parameter rj = 0.15. Price and 
volatility processes are assumed to follow the same model as in the second setting. 


We simulate price processes and observation times for 10 years of 252 business 
days. We choose for Settings 2 to 4. We provide in Table a summary of 

the comparison results between HY and the bias-corrected HY. As expected from the 
theory, the RMSE is improved when using the bias-corrected estimator in Example 
In Example the bias-corrected HY doesn’t seem to perform better than HY. 
We conjecture that there is no asymptotic bias in Example and that this is the 
reason why we don’t observe any difference between the two estimators in that simple 
model. In addition, the sample bias is almost the same when using HY and the bias- 
corrected estimator for the four different settings, which is also expected from Remark 
1^ Furthermore, this sample bias tends to 0, which comes from the fact that both 
estimators are consistent. Finally, the standardized feasible statistic (50) in the hrst 
setting is reported in Tableand plotted in Figure]^ 


7 Conclusion 

We have introduced in this paper the HBT model, and we have shown that it is more 
general than some of the endogenous models of the literature. This model can be 
extended to a model including more general noise structure in observations, and even 
noise in sampling times. This is investigated in Potiron (2016). 

Under this model, we have proved the central limit theorem of the Hayashi-Yoshida 
estimator. Our main theorem states that there is an asymptotic bias. Accordingly, we 
built a bias-corrected HY estimator. We also computed the theoretical standard devi¬ 
ation, and we provided consistent estimates of it. Numerical simulations corroborate 
the theory. 

The techniques used for the proof of the main theorem could be applied to more 
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general models and to other problems such as the estimation of the integrated variance 
of noise, integrated betas, etc. In particular, independence between the efficient price 
process and the noise is not needed in the model. As long as we can approximate the 
joint distribution of the noise and the returns by a Markov chain, ideas of our proof 
can be used. 


8 Appendix 

8.1 Definition of some quantities of approximation 

We dehne in this section some quantities assuming the volatility matrix at and the grid 
function gt are constant. For that purpose, let Wt be a four dimensional Wiener process, 
c := {g~ , K, d) a four-dimensional vector such that c E C and d a constant volatility 

matrix such that the associated A““, which is the analog of dehned in Section 4.1 
when we replace at by a, is stritcly bigger than 0 and g E Q{c) a constant grid function. 
In analogy with the dehnition of the grid function gt in (A3), we assume that g can be 
written in terms of the down and up functions of both assets, i.e. g := {g^^\g^‘^^) where 
for each k = 1,2 we have := Also, we introduce Sg the subspace of 

dehned as 

Sg := {(y, n) e M X s.t. d^'^\v) <y < iP‘\v)}. 

If we set X = dW and the corresponding sampling times of both assets T := 

where for A; = 1, 2 we have ;= we dehne the observation times of the hrst 

asset as := 0 and recursively for i any positive integer 

;= inf [t > ^ [d^^\t - 

These stopping times will be seen as approximations of the observation times of the hrst 
asset when we hold the volatility matrix at and the grid gt constant. We will always 
start our approximation at a 1-correlated observation time which corresponds to 
an observation time of the hrst asset. As the sampling times of the second asset are not 
synchronized with the ones from the hrst asset, we need two more quantities {x, u) E Sg 
to approximate the observation times of the second asset. They correspond respectively 
to the increment of the second asset’s time process XI ’ since the last observation of 
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the second asset occured and the time elapsed since the last observation time of the 
second asset. We define := 0, 

;= inf > 0 : X + ^ [d 2 {t + u),U 2 {t + «)]}, 


and for any integer i >2 

:= inf {t > ^ [d 2 {t - f^l\),U 2 {t - 


Similarly, we define the analogs of (17)-(18), ( [19| ), (20), ( |2^ , (23)-(24), (25), (26) and 
( ^ respectively as f^i, f^i, ^ ^ AX^ic,-,+ nnd W by putting tildes 

on the quantities in the definitions. 


8.2 Preliminary lemmas 

Without loss of generality, we choose to work under the third scenario defined in Section 
2.4, i.e. the asset price is different from the time process for both assets. Because we 
shall prove stable convergence, and because of the local boundedness of a (because by 
(Al) a is continuous), and that infjg(o,i] A™™ > 0 we can without loss of generarality 
assume that for all t G [0,1] there exists some nonrandom constants a~ and such 
that for any eigen-value Xt of at we have 

0 < < At < cr"'", (51) 


by using a standard localization argument such that the one used in Section 2.4.5 of 
Mykland and Zhang (2012). One can further supress p as in Section 2.2 (pp. 1407-1409) 
of Mykland and Zhang (2009), and act as if X is a martingale. 

We define the subspace Ad of matrices of dimension 4x4 such that VM G Ad, for 
any eigen-value Am of M, we have 


a < Xm < cr 


(62) 


and ^ By (8) of (A2) and (51), we will assume in the following 

that Vf G [0,1], at G Ad. 

We define a^ the process (of dimension 4 x 4) on M’*' such that 


erf = at VfG[0,l], 
erf = ai Vf G [1, cx)). 
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Define now the process such that for alH > 0 

f dXf = a^dWt, 

\ A'J = A„, 

Because X^ and X have the same initial value and follow the same stochastic differential 
equation on [0,1], they are equal for all t G [0,1]. For simplicity, we keep from now on 
the notation X for X^. 


In the following, C will be dehning a constant which does not depend on i or n, 
but that can vary from a line to another. Also, we are going to use the notation as 
a subtitute of where 6 can take various names, such that (1), (2) and so on. Let 
h : N —)■ N a (not strictly) increasing non-random sequence such that 


hji —^ -|-cx), (^3) 

hnOin 0. (54) 

To keep notation as simple as possible, we dehne := := := 

:={*>! s.t. < t}, where t G [0,1]. Also, we recall 

the notation {xj:^\ := {xj:^’^\xj:^’^'^) Finally, for 9 G {(1), (2), 1C, h}, we define 

= sup Arf„. We show that these quantities tend to 0 almost surely in the following 

T® <T 

2,n 

lemma. 


Lemma 5. IFe have s® 0. 


Proof. We can follow the proof of Lemma 4.5 in Robert and Rosenbaum (2012) to prove 
that for k G {1,2}, Sn^ 0. Then, we can notice that a.s. < Sn^ + Sn^ to deduce 
that 0. To show that —)■ 0, define the process Z such that Zg = 0 and Vi > 0 

we have 


:= 


Substituting X in Lemma 4.5 of Robert and Rosenbaum’s proof by our Z, we can follow 
the same reasoning. The only main change will be that in their notation < Ch„a„, 
but this tends to 0 by (54). □ 
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Let / be a random process, s a random number, we define 


S (/, s) := sup fu- fv ■ 

0<u,v<l,\u—v\<s 

Lemma 6. Let f be a bounded random process such that for all non-random sequence 
i^ri)n>o’ Qn “t 0, thcn S {f, Qn) “t 0. Let also a random sequence such that 

P 

Sn —)■ 0. Then we have V/ > 1 that 

S if:Sn) —t 0. 

Proof. As / is bounded, convergence in P implies convergence in L* for any / > 1. 
Hence it is sufficient to show that S (/, s„) —)■ 0. Let rj > 0 and e > 0, we want to show 
that 3N > 0 such that Vn > N, we have 

{f,Sn) >r]) <e. 

3 non-random y > 0 such that P {S (/, y) > 77 ) < |. Also, 3A^ > 0 such that Vn > N, 
P(Sn > x) < §• Thus 

T (5'(/, S„) > 77 ) = P (S'(/, s„) > 77 , > x)P (S'(/, s„) > 77 , Sn < x) 

< T(sn > x)+T(*^(/,x) > h) < e. 

□ 


We aim to define the approximations of observation times on blocks 


We need some definitions first. Let (C'j*^)j>o a sequence of independent 4-dimensional 
Brownian motions (i.e. for each i, is a 4-dimensional Brownian motion), indepen¬ 
dent of everything we have defined so far. We define Vi,r7 > 0, 


sr ■■= 


AW,. , + C-f\ , 




{L,j,n) j>0-k=l,2 
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To keep symmetry in notations, we define for all integers i and n positive integers, 




( 1 ) 


i>o 


consisting of the observation times of the process 1 after substracting 


the value of i.e. ~^i\n where i* is the (random) index on the original 

grid of process 1 corresponding to = 'b%)- For process 2, we define = 0 

and for integers j > 1, where j* is the index on the original 

grid of process 2 corresponding to the smallest observation time of process 2 bigger (not 
necessarily strictly) than We also dehne rj“ „, 


-.ic ~ic- ~ic- 


, . . , . . , . . T- . T. . T- • 

2j,n’ IiJiTi "> 


’ P,j,’n following the coustructiou we used to dehne (|17|), (|18|), (|19|), (|22|), 


(23), (24) and (25). We also set 


\ ’ ’ L -i n ^ -i n .l 




Lemma 7. For 6 G {(1), (2), 1C}, any real I > 0, any positive integer i and n, any 
non-negative integer j, we have 0 < C'[ < Cp such that 


<C*al 


(65) 


where := and 


cyan < E 


Ar, 


(fc) 


^ /o+^2/ 

< Cl a^. 


(56) 


Proof. For 9 G {(1), (2)}, because of ([^ and (51), we can deduce (55) using well- 
known result on exit zone of a Brownian motion (see for instance Borodin and Salmi- 


nen (2002)). (56) can be deduced using Dubins-Schwarz theorem for continuous fo¬ 


cal martingale (see, e.g. Th. fo.1.6 in Revuz and Yor (1999)). \1 9 = 1C writing 
-e _ (;;Ld,+ ~e \ I (~d,+ ~e,+ \ working those two terms, we can 

□ 


/\f0 ^ ( fO,+ _~y 1 , , ~e,+ _ ~e,+ 


obtain (55) and (56). 


Now, we dehne for 9 G {(1), (2), 1C, h} the number of observation times before t as 

Kn = < t}. 

We have the following lemma 

Lemma 8. For 9 G {(1), (2), IC*}, we have that the sequence is tight. 
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Proof. Here for 6 G {(1), (2)} we can follow the proof of Lemma 4.6 in Robert and 
Rosenbaum (2012) together with Lemma 5, Also, by dehnition we have so 

we also deduce the tightness of ^ ^. □ 


' n>l 


Lemma 9. Let o,n array of positive random variables and 6 G {(1), (2), 1C}. 

If 


Vn > 0, E-:T" ^ 4 0 (57) 

then Y^k Ur,n 4 0. Also, tf'iu > 0, 4 0 ^ 4 0. 

Proof. Let e > 0 and u > 0. 




Lit Q: 71 


N? 

^^t.n 




<KJ 


2 = 1 


2=1 
^ -2 


-2 I 1 

2=L2Xan J + 1 


X] Ui,nl{^uap^>NlJ > ^ 


i=NL+i 


LUan 


N? 


\ i=Lna^^j+l y 

Lua-^. \ / <n 

< P I ^ Ui^n ^ 2 I ^ 1 ^ > 2 

\ / \2=L22an^J + l 

/L UQn^ J \ 

— ^ 1 ^ 9 ) ^ -I < ■ 


2=1 


We take the limsup and use (|57|). We obtain 

n^oo 


'N' 


t,n 


limsupP j e 1 < limsup P (lmct^^j < N^n) 


2 = 1 


We now tend u ^ oo and conclude using Lemma The second statement is proved in 
the same way. □ 
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Lemma 10. For any a > 0, a E Ai,g € Q, {x,u) G Sg, we have that 



{a,g,x,u) 

= (cr, ag, ax, a^u) 


{a,g,x,u) 

= (a, ag, ax, a^u 

^AC2 

{a,g,x,u) 

= {a, ag, ax, a^u 


{a,g,x,u) 

= a~‘^'ilj'^ {a, ag, ax, a^v) . 


Proof. For any Brownian motion iWt)^yQ, by the scale property we have that = 

{a~^Wa■ 2 t)^yQ■ Thus, if we define r = inf{f > 0 s.t. Wt ^ [d{t), u{t)]} and Tq = inf{f > 
0 s.t. Wt ^ [ad{t), au{t)]}, we have that 

T = inf{f > 0 s.t. Wa' 2 t ^ [ad{t), au{t)]} = a~‘^Ta. 


We deduce that 


O.Wr) = 


(58) 


We can prove the lemma based on the way we proved (58), at the cost of 2-dimension 


dehnitions that would be more involved and straightforward applications of Strong 
Markov property of Brownian motions that we won’t write, so that we don’t lose our¬ 
selves in the technicality of this proof. □ 


We introduce the number of points in the ith block in the fcth process as the following 

= max{j > 0 s.t. -f 

We also introduce the total number of points in the zth block iVj,n = 

We show now that we can control uniformly the error of the approximations of the 
observation times. 


Lemma 11. Let I > 1, we have that 


sup E 

i>0 , 2<j</iyi 




= Op {an) 


and 


sup E 

i>0 , 2<j<hn 


^ IjyTi ^ I 


= Op (“?) ■ 


(59) 


(60) 
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Proof. We introduce the notation Op where U stands for “uniformly in z > 0”, meaning 
that the sup of the rests is of the given order 

First step : We define = supf/^^ We show in this step that 


s: 4 0. 


( 61 ) 


We define the accumulated time of approximated durations, i.e. 


= 

' i.n 


l=i 

1=0 


1C 

hn.n- 


Using Lemma together with Lemma 3M > 0 such that 




We define Z" = 0 and Vf e [fti,n: 


ryn _ ryn I 02—l,n 

, + ^t-r^ 

i —l,n ^ — 


A slight modihcation of the proof of Lemma will conclude. 

Second step : We show that we can do a localization in the number of observations 
in the i-th block, i.e. there exists a non-random such that 


p(max(Af<:.',A6'J>) >M„) 


( 62 ) 


converges uniformly (in i) towards 0 and increasing at most linearly with i.e. 
we have ^ l3hn where /? > 0. 


To prove (62), we need some dehnitions. Dehne for z > 0 the order of observation 


times Oi^k,n and the order of the approximated observation times Oi^k,n in the following 
way. Let := (Lp>)j>o sorted set of all observation times (corresponding to 
process 1 and 2) strictly greater than Then for j > 1, we will set = 1 if 

the j-th observation time in corresponds to an observation of the hrst process and 
Oij^n = 2 if it corresponds to an observation of the second process. Similarly, we set 
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the sorted set of all approximated times 


same way. There exists a p > 0 such that for all integers i,j, n : 


1 ■ Onn are dehned in the 


IP I Oijn,n — 1 


Oij,„ = 2^ > p and P (Oij+i,„ = 2 Oij,„ = l) > p. 


(63) 


Indeed, let I the (random) index such that Conditionally on Oi^j^n = 1 


we know that Oijn,n = 2 if AX,^, 


+r. - 

i,n ' 1,3,n 


crosses or —before o 


crosses g or —g . Using of (A2) and (51), we can easily bound away from 0 this 


i,n ' 


probability, thus we deduce (63). Now, using (22) together with (63) and strong Markov 


property of Brownian motions, we deduce (62). 


Third step ; let p = {d,u) such that {g,g) eQ, a E and e < ^. We dehne 

T{g,a,e) = inf{t > 0 : aWt = u(t) + e or aWt = d{t) — e}, where Wt is a standard 
Brownian motion. We show that 

E r (p, a, e) - r (p, a, 0) < (64) 

where 76 ) (e) 0 . 


In order to show (64), let 


O (p, a, e) = inf{t > 0 : aWt+r{g,a,o) = min {u{T{g, a, 0)) + Kt + e, g^) 

or aWt+r{g,a,o) = max {d{T{g, a,0))- Kt- e, g~)}. 

By (g and ,0 of (A3), we have r {g, a,e) — t {g, a, 0 ) < O {g, a, e). Conditionally on 
T{g,a,e) C using strong Markov property of Brownian motions, we can show that 


'^r{g,o-,e) 

instance. 


O {g, a, e) 0 using Theorem 2 in Potzelberger and Wang (2001) for 


Fourth step ; let k E {1,2}. We show here that 




7 


(k) 


7 


(fc) 

i,j,n 


The idea is to show that by recurrence in j, E 


= K) • 


AU _Afc) 


when n grows. It is then a straightforward analysis exercise 


(65) 

can be arbitrarily small 
o use the localization in 


35 


























second step and choose a different sequence h if necessary, that will still be non-random 


increasing and following (53) and (54), so that the sum in (65) will be also arbitrarily 


small. Let’s start with j = 1 and k = 1. 


E 


Ak) _Ak) 

2 ,l,n 2 ,l,n 


= E 


Ak) _~{k) 


E 


Ak) _Ak) 

i.l.n 2,l,n 


LeP 

i,n 


where Ei^n = E^^^ fl E^'^^ with 


= 

i.n 


sup 




hi.d 1 ^ h 


E^^^ = 

I.n 


sup 

- ^6hi!n-'^L+'ri,V,nV4Vnl 


- 0^'^ II 


< Vl,n >, 


Vi,n = qndn, Qn = max (at Zn‘^') and Zn = sup (e [S (cr“’’’, s’^ V s^)Y ) . By 

V / 1 < 7 /,. 7!<4 V L J/ 


(58) and (64), 


E 


(fc) _~(fc) 

' i,l,n ' i,l,n 


< Caf {2qn) + (-2gn)) • 


Using Cauchy-Schwarz inequality and Lemma 


E 


ik) _ {k) 




< CaJP < Co?/ (P ((4‘„')^) + P {{k^y)) 


1/2 


On the one hand. 


p 1 14 / 


c 


< {rji^n) ^ E 


sup 


*e[r^ ,A„+UV vfW 






A ,4 

i.n’ J 


< C (rji n) ^ max E 


"U,n^U,l,n^U,l,n 


1 /2- 


ay - ay ) ds 


A C ( 71 ,n) ^ max E 


(T 7 „vf' 7 )sp“'”, 4 vsa")‘''' 


< Czl/\ 


JI) Vfp> 

i,l,n ^ '2,1,n 


1/2 
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where we used Markov inequality in the hrst inequality, conditional Burkholder-Davis- 
Gundy inequality in the second inequality, Cauchy-Schwarz inequality in the fourth 
inequality. Lemma in the last inequality. On the other hand. 


P 



< ( 771 ,„) ^ E 


sup 




< C ( 771 ,„) ^ E 


yfW 

2,l,n ^ 2,l,n 


where we used Markov inequality in the hrst inequality, ( 12 ) of (A3) in the second 
inequality. Lemma in the last inequality. In summary, we have 

E (2gn) + 7^'^ (-2gn) + 4^' + cr'"-'/') . 




f 

‘ij,n 

i,j,n 



which we can make arbitrarily small, because —)■ 0 by hrst step together with Lemma 
[^and the continuity of a (Al). The case with fc = 2 is very similar. Finally, for J > 1, 
the same kind of computation techniques, using in addition ([II| of (A3), will work. 


Fifth step ; Prove that uniformly (in i) 


IP ( Vj < Mn, Oij^n — 1. 


( 66 ) 


O _ 


To show ( 66 ), let j < M„. We dehne the (random) index v such that = r, 
Modifying suitably h if needed, there exists (using fourth step) a sequence (e„) such 
that 


P 


Ak) _Ak) 


p 


r® - 

2,t;+l,n i^v,n 


< 




I 5 

^ 0 . 


Using (67) and ( 68 ), we can verify ( 66 ) by recurrence. 


Sixth step : We prove here (59) and (60). Using Lemma and ( 66 ) we obtain 


E 


'1C 




= E 


Z.1C 


ArU - Ar. 

i,j,n ' i,j,n 




+ o? • 


(67) 

( 68 ) 
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The first term on the right part of the inequality can be bounded by 


C E 


^ic ~ic 

T- ■ — T- • 

' 2 


{Vi<M„ ,Oj =dj } 


+E 


-1C 


'1C 


q- _ q- 

'zj —l,n — 




Both terms can be treated with the same trick. Using the second step and Lemma 
the hrst term is equal to 


E E 


V<Mn 


^IC ~1C 

' i,j,n ' z,j,n 


t i,j,n i,v,ni 


+ K) • 


The sum is obviously bounded by 

Ee 


V<Mn 


^IC -IC 
T- ■ — T- • 

z,j,n z^j^n 


and using (65), we prove ( |59[ ). We can deduce (60) with the same kind of computations. 

□ 


Let M” the interpolated normalized error, i.e. 


= a 


-1 

n 




corresponds exactly to the normalized error of the Hayashi-Yoshida estimator if we 
observe the price of both assets at time t. We recall the definition of 


W,n = 

^ i,n 


i.n 


rlC 




Lemma 12. fUe have 


(am 


n 

.h 

i.n 


= a. 


-2 


N Eh 

Z—/ U-l,; 


Z^An 


ZG-Ati 

hfl 


u=2 


+ Op(l)- 
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Proof. We obtain this equality noting that {Ni^n)n>o centered and 1-correlated, and 
that the terms left converge to 0 in probability. □ 

We introduce the observation time at the start of a block, where “s” stands for “start” 


= sup{rj;„ s.t. 


Lemma 13. We have 
\ E /i 

Z—/ U-i.i 


a 


-2 


ieA. 

h„-2 




u=2 


= 


,a^^x,a^‘^v) dni_ij^n{x,v) + Op{l). 
^ ^ 


i&An j=0 


Proof. First step : approximating with holding volatility constant. Set 


Nin= {o't^ AUhlC^ (arO AW 1C-A 


( 2 ) 


r.lC 

i,n 


Ct ds 




where is the i-th component of the vector A. We want to show that : 




, -2 


Z—/ U-i.i 




^ ^ {jd{i—i)hn+v^ A 


u=2 


= a 


-2 


V E^h 
Z—/ w-1,' 


i<^An 


•^n 2 

'y ^ {i—l'^hn+U^{i—l)h„+u+l 

u=2 


+ Op (1) . 


Noting Fi^n = (A^i,n) + 2W,nA^i+i,n and = ( W,n ) + 2W,nA^i+i,n, it is sufficient to 
show that 


a. 


^Et- 


-2 

/ -J — 

i>l 


F - F 

i.n i.n 




X r\ 

^0, 


(1) 


that we can rewrite as ^ 

it is sufficient to show that Vu > 0 ; 


I.n I.n 


l{^Al,n<d 


—)• 0. Using Lemma 


a. 


-2 




2=1 


F - F 

2,72 2 ,n 




A 0. 
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Thus, it is sufficient to show the convergence of this quantity, i.e. that 


a. 


-2 


2=1 


F - F 

2 ,n 2 ,n 




^ 0 . 


We have that 


F — F 

^ i^n ^ i,n 


< + 2b£’, 


where B^l = 


-N^ 

i,n i,n 


and 5(^2 = 


N- -I N- - iV- 1 N- 


We have that 


where 


f)W ^ ^(1) ^(2) I ^(3) 
-^i,n — ^i,n ^i,n ' ^i,n ’ 


^(1) _ 

^i,n 


^(2) _ 


ax^]Iax^^^ 


'^i,n 


(Tt-s AhWic 1 

i — l,n ‘i,n J 


x(i) 


AC 


AC 






Ct ds 


(Tr® AW 1C.- 


( 2 ) 


cS = 2 




t,n 


i,n 


ic,-,+ / 

*■" Jt^C 

i — l,n 


Crds 


. , d) / ^ \ (2) 

— I (J-r® AWLlC I ( (Tt-s aw IC.-.-f 

' ' V i-i." n.„ 


rlC- 

r,n 


,1C 


CJ, ds 


Let’s show that E 

where 


^(1) 

n-i.n 




—)■ 0. We can write it as 



40 



We want to show that ® 




i,n^{Tf_i <t} 


—)■ 0. We dehne ; 




.lC',-,+ ? 

i,n 


eS = 1'^- 


AM/ 


__ ^IC 

^-l,n 'in 


( 1 ) 


AX 


( 2 ) 

ic,-,+ - 

i.n 


Using Canchy-Schwarz ineqnality, we dednce : 


E 




<t} 


= E 
< ( E 


+ 45 ) (45 - 45 ) 


p(l) _i_ p(2) 

W,n + W,n 


E 


p(i)_ 1 

i,n i,n J 


1/2 


Using Canchy-Schwarz ineqnality together with Bnrkholder-Davis-Gnndy ineqnality 
and Lemma we obtain that : 


E 


r-'(l) _i_ p(2) 
^i,n + ^i,n 


= K) 


where U stands for “nniformly in 1 < i < Another application of Canchy- 

Schwarz ineqnality gives ns 


E 


pi(i) pU) \ 1 

i,n i,n ) lU-i „<d 


< E 




(Jr‘> AhWlC 

t — l,n 'i^n 


( 1 ) 


4t'> <t} 

t I.n J 


E 


AX 


( 2 ) 


i.n 


1/2 


Using once again Canchy-Schwarz ineqnality together with Bnrholder-Davis-Gnndy in- 
eqnality and Lemma we obtain that ; 


E 


AX 


( 2 ) 


= K) 


Similarly, we compnte nsing conditional Bnrkholder-Davis-Gnndy in hrst ineqnality, 
Canchy-Schwarz in third ineqnality. Lemma Lemma and Lemma together with 
the continnity of a (Al) in last eqnality. 


E 


AX^l^ - ( ars AlU^ic 


( 1 ) 


4Wi,Wd 


41 

























< 


C sup E 


U-l.n 




— a 


hi 

''"/-l.n 


2 



2 


= C sup E 






— a 


hi \ 


2 



2 


< C sup E 

i<i,^<4 


+0^(«n) 


< C E 




E 


sup 

.i<i,^<4 


1/2 


+ («^) 


= o^K)- 


— z 

With the same kind of computations, we show that Yll=i ® 




and we also can show ^ Yl^=i ® 

0 (thus we have also that o;”^ '^i=i E 

-2 

uan 


^i,n -*-{'r?_i_„<t} 


dW 1 

i,n 


^ 0, a-2 Er=f E 

—)■ 0) and 


i,n -'-{T|'-l,n<*} 

( 3 )- 


—)■ 0 , 




-)■ 


-2 


i=l 




t,n „<d 


^ 0 . 


Second step ; approximating using ('rij,n)j j„>o instead of ('ri,n)j„>o- We set 


~ f 

\ ( 1 ) / \ ( 2 ) 

Nij^n= [(Trh AWfic ) (ah AW-ic,-,+ ] - 

\ / I ~1C w.n 

«/ 7”- • 1 

—l,n 


ci^ ds. 


We want to show that 

E h 

Z—/ U-l,n 


a 


-2 


ieA„ 


u=2 
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= a. 


-2 




.h 

i—l,n 




E(^. 


u=2 


i—l,u,n ) “1“ 


+ Op ( 1 ) 


Using the same kind of computations as in the first step together with Lemma [m we 
conclude. 

Third step : express the result as a function of . Using Lemma IT in last 
equality, we deduce for any integer u such that 2 < u < hn that 

I —l,Tl \ / 

= / ) d^i,u-2,n {x, v) 

= «n / ,a-^X,a-'^v)d^i^u-2,n{x,v). 

/Tn>2 V ^ ^ / 


□ 


Lemma 14. Vcr G Ad-iQ G Q,^TT{(T,g) distribution such that : 


hn-2 


ILn ^ n 

’fi'r'Li ,an^X,a~^u] d7ri_ij^n{x,u) 
isA. i=0 V * / 

= alJ2 

i^A-n 

Proof. We define the transition functions of the Markov chains ( Zi (a, g )) defined 

V / i>o 


in (33). For (x,m) G iSg, B ^ B (Sg) (borelians of Sg) 


p A, g) ((a^, u) ,B) = P [Zi (a, g) e B 


^0 A, g) = {x, u) . 


First step : We prove that Va G Al, 'ig G Q, the state space Sg is i/-small, i.e. 
there exists a non-trivial measure z/ on i3(]R^) such that \/{x,u) G Sg,\/B G B{Sg), 
P {a, g) {{x,u), B) > ^'{B). Let B = [xa,Xb\ x [ua,Ub\. We are choosing v such that 
1 ^ = 0 outside [—^, ^] x [3,4]. Thus, without loss of generality, we have that [xa,Xb] x 
[ua,Ub\ C X [3,4]. We want to show that 3c > 0 such that uniformly 


P (cr, g) ((x, u) ,B)>c {xb - xj (ub - uj ■ 
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There are two useful ways to rewrite The hrst one is ; 

:= (69) 

Xf) ;= ^ _ ^^3,4^2y/2^(4)^3,±_ 

where and are independent, G [p!i’^,p+^] and max (—pi^^) < 1 (be¬ 
cause a G A4 ), 

5 = (^1 - max ((pi’^)^ , (p+^)^)) ' ■ (71) 

The other way to rewrite it is ; 

y'‘> := a'-'p*’, (72) 

Xf> := /•‘cr<®>p‘‘*+(l-(p''‘)^)‘'’T®B'’-^. (73) 


where B^^'^ and B^'^ are independent. For (i?t)^>Q a standard Brownian motion, 
a < X < b, we denote the exiting-zone time of the Brownian motion 

r“’^ = inf{f > 0 s.t. x + Bt = a or x + B^ = b} 

and pi{x,a,b,t) the density of r“’^. We also dehne p 2 {x,a,b, s,y) the distribution of 
Bg + X conditioned on {r“’^ > s}. Finally, let p 3 {x,a,b,t) the distribution of 
conditioned on {B^a,b = b}. All the formulas can be found in Borodin and Salminen 
(2002). Consider the spaces Ci = C 3 = {{x,a,b,t) G s.t. a < x < b,t > 0}, 
C 2 = {{x,a,b,t,y) G s.t. a < x < b , a < y < b , t > 0}. The functions pi are 
continuous on Ci and positive. Thus, for all compact set Ki G Ci, we have 

inf Piik) > 0. (74) 

k&Ki 

We can bound below 

P{a,g){{x,u),B) > P fj fl ^2 fj ^3 fj ^4 ^0 = (a:,«)) , 
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where 
Eq = 

El = 
E 2 = 
E 3 = 


sup 


,(2) 


sup 


xp) 
xP^ 


< .g-1) < j,., 




< 


15(T+ 

ea” 


sup 


sup 


P2) 

'2 

.( 2 ) 




10cr+ ’ 
~-( 2 ) 


AB 


3,-L 


< 


9 


4(a+) 


2 I ’ 


<5 ,^2 e[x + 2 ,x + 


Vs e [rf ,X + 4] xf) G [diiK),uiiK)] , xjj^^ G [ui(X) - 2e,ui(X) - e] 


n{ 


sup 

f2 '’<s<K+A 


[4\s] 


Ea = \ e [Ua + f 2 ^"^ Ufe + , inf 


.( 2 ) 


<12^’ 


(2)l 


'(3) 


> - 2 e 


A'+4<s<f- 


( 1 ) 


n{ 


sup 

K+A<s<f4'’ 


{4\s\ 


44 ) 


< g , AX (-2) -{!), ^ [^ai 

U2 >u I 


where e = Using extensively Bayes formula, we can rewrite 

F (eo[]Ei[]E2[]E3[]E,[]{Zi G B} 


Zo = {x,u)] = IxIIx III X IV xV, 


where / = P ( U, 


P U 2 


{Zo = {x,u)}], II = f[Ei 


Eiy^{Zii = (Xju)} ), and also III = 


P ^4 


El n Eo n {^0 = (x, u)} , JU = P U 3 


U 2 n -Si n Eq n {^0 = {X ,«)} ) and V = 


u3n^2n^in^on{^o = (a:,u)} . 


We prove that / is uniformly bounded away from 0. Using (52), (69), (70) and (71), 
we deduce that Eq^^ Eq^^ C Eq where 


= { sup 

'-0<s<iC 




< 


ea min((T , 1 ) 


15 (cT+)" 


£;(2) _ 
U/Q — 


sup 

0<s<K (j 


X 


(4) 


(1 - (p3,4)2) 


2x1/2 


+ -BP 


^ ^ ecT min(a , 1 ) 


5a~ 


15 (o-+)" 


Conditionally on {Zq = (a:,^)}, E^^ and E\4’ are independent. Thus, we deduce 


( 2 ) 


/ > P u 


41) 


{Zo = ix,u)}]F{E, 


42) 

^0 


{Zo = ( x , m )} . 
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Using Markov property of Brownian motions, we obtain that the right part of the 
inequality is equal to 


-if 


Pi 0, 


ecr min(cT , 1) ecr min((T , 1) 


15 (cT+)' 


15 (cr+)" 


, dt^ j Pi j dt, 


where = 




’ ^ ’ which is uniformly (in x, cr and 


g) bounded away from 0 using (52) and (74). 

We prove that II is uniformly bounded away from 0. Conditionally on i?o = 

(x,-u)}, the two quantities of Ei are independent. Thus, we bound below II (the same 
way we did for I) by 


r-if+l 


U(2) 


Pi B 


j( 3 ) eg- _ eg- 

■rP'’ 10(T+cr*^^h lOcr+a^^h 


,t]dt 


P 2 ) (°’ 4cr+aU)’4cr+aU)’^ ) 1 ’ 


which is uniformly bounded away from 0 using (52) together with (74). 


We prove that III is uniformly bounded away from 0. Using (52), (69), (70) and 
(71), we deduce that -^2 where 


^( 1 ) _ 
^2 — 


^( 2 ) _ 
^2 — 


sup 

K+l<s<K+3 


sup 

K+l<s<K+2 


5 ( 3 ) 


< 


AB' 


5(7+ i ’ 
9~ 




< 


K+2<s<K+3 


AB 




Q ^ 

> ^ + 


6a~ 


5a+5 


Conditionally on i?i f] i?o = (^^w)}, E^'^ and are independent. Thus, we 

deduce 


n(2) 


JJJ > P E. 


41 ) 


Ei[]Eo[]{Zo = {x,u)})f Ei[]Eo[]{Zo = (a:,^)} ) . 


Using Markov property of Brownian motions, we obtain that the right part of the 


inequality conditioned on Ei Cli^o = (^^w)}} is equal to 

hi >.^+ 1 ] 


Pi [B 


>(3)_^_ 

5a+’5cr+^ 


,t) dt 


I A 53 ,± 9 9 

Pi ( A5[.(2 )^^^i], 2a+'2a+'‘ 


,t] dt 
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^ I'’ll 


2cr+ 


^ + ^^,t] dtdq{y), 


where q is the (conditional) distribution of 


[fr‘,K+i] 


+ Bi conditioned on 






Using the dehnition of Ei together with (52) and (74), we have III which is uniformly 
bounded away from 0. 


We prove that IV is uniformly bounded away from 0. Using (72) and (73), we 
deduce that e'^'^ f) Ef^ C E^ where 


Ep = 


.(2)< 


sup 


AS 


(4) 




< 


ecr 




sf = ^ V. G K + A] e [yl\yr] , e [yl\yt 


-.m 


,(b 


with 7 /^^^ = MK)+2e/^ (2) ^ ^lW-2U5 (3) ^ u^{K)-%e/5 ,, 

a(4)(l-(p3,4)2^1/2’ ^3 cr(4)(l-(p3.4)2^1/2> % (rW (l-(p3,4)2) ’ 

^ • Conditionally on Ssn^ifl^oni^o = {x,u)], and 


are independent. Thus, we deduce 


JU > P S. 


41) 


S2flSif|Sofl{Zo = (x,n)} 


P S. 


42) 


E2[]E,[]E,[^{Zo = M}) . 


Using Markov property of Brownian motions, we obtain that the right part of the 


inequality conditioned on {f. 


( 2 ) 


S 2 n n ^0 ni^o = (x ,«)}} is equal to 


rK+4-f. 


( 2 ) 


1 - 




K-l-4—Tr 


(2) N 

Pi ( 0 ,l/f \ t) dt 


I'vl 


/ (3) 
'Va 


P 2 ( 0 , yi^ly^sl K + A- ff \ dy, 
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which is uniformly bounded away from 0 using (52), (71) and (74). 


We prove that V > c(xb — Xa){ub — Ua). Using (69) and (70), we deduce that 
Uf ^ n C Ei where 


uf^ e[Ua + T^^’,Ub + T^^’] , = Ui 


( 2 ) 


;(2)l vi^) _ 


uf) = 


sup 

K-\-4:<s<r 


AB 


[ir+4,s] 


< 


5g- 


Tj2 ’ ^^[1+4,f] e 


(1) .,(2)l 


(l — 

f = inf{f > X + 4 : Xf) = u,{K) or = -2e}, 


Xa - AX% 

(1) _ [ff\K+4] 

Ui — 




a 


(4) 


(1 - (p3,4)2) 


2x1/2 


and yf^ = 


- '4 ’ , -U/A- —■ We have 

^(4)(l_(p3.4)2)l/2 

U = P = Ul(X) 


xp ) fi uf) Us fi ^2 n n ^0 = MK)}j . 

The first term on the right part of the equation is uniformly bounded away from 0 
(Borodin and Salminen (2002)). Because f is a function of X^^) ^nd is independent 
with f and are independent. Thus the second term on the right conditioned 
on 

{y^^\yf\xP_^„ ) Us fl U 2 fl Us f| Uq []{Zo = {x, u)}} 


can be expressed as : 


ru,+f^^>-(K+4) (x^^\^ ^x|4-2e Ui{K) 


' Ua+f^^X(^K+4) Jyl 


( 1 ) 


P3 


cr(3) ’ (j(3) 

1/2 


cr 


(3) 


U P2 0, 


bg bg 


(3) 

Vl Vi 


^s) i^iV j dtdy , 


where y^^'^ = — (p^’"^)^j • We have that y^'^ and y^'^ are dominated by 


3g 


4o-(4)(l-(p3,4)2y 

c{Xb - Xa){Ub - Ua). 


Yj 2 . Using this together with (52), (71) and (74), we deduce that V > 
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Second step ; We prove that 

^AV 

:= sup 

(a,g,x,M) 



°° creM,g£g,(x,u)eSg 



< cx). To 


show this, we bound the term as 


E 




jilC^ 

^2 


2 


C^’^Ah 


1C 


< 2E 




^2 


2 


+ (Ar^^^ 


The second term in the right hand-side of the inequality is uniformly bounded using 
(52) and Lemma Using successively Cauchy-Schwarz and Burholder-Davis-Gundy 
inequality, (52) and Lemma we can also bound uniformly the first term. The other 
term of (29) can be bounded in the same way. 

Third step : Dehne q = (a, g, x, u) and 

Q = {{a,g,x,u) s.t. a G M.,g G Q, {x,u) G 5^}. 

Prove that Vg G Q, there exists a measure tt (ct, g) such that 


sup 

geS 


n—1 „ 

E/ 

1=0 


a, g, y, v) d^i (a, g, x, u) {y, v) - n I (a, g, y, v) dir (a, g) {y, v) 


= nOp{l). 

To show this, we use first step together with Th.16.0.2 [v) (Meyn and Tweedie (2009)). 
We obtain that there exists tt {a, g) where 




TT (c, g) 


< 2r" 
TV 


and r = 1 — z/ (M^). Thus, we deduce : 



{a,g,y,v)dni {a,g,x,u) {y,v) 


(o', S', I/, v) dn (a, g) (y, v) 


< 




AV 


7ii (a, g, x,u)-7i (a, g) 


< 2 


TV 




AV 


(75) 


We want to show that Ve > 0, 3A > 0 such that Vn > N : 


n—1 „ 

E / 

1=0 


(a, g, g, v) dTii (a, g, x, n) (i/, v) - n I (a, g, g, n) dn (a, g) (g, n) 
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< en. 


( 76 ) 


The rest is a straightforward analysis exercise. Let e > 0. 3iVi > 0 such that 
Choosing N > we first use the triangular inequality, and 


rNx ^ I 


then split the sum of the left part of (76) in two parts, one up to Ni and the other one 


up to N. We use (75) in the second part to obtain (76). 


Fourth step : Proving the Lemma. Let tc > 0. From Lemma we just have to 
show that 


L.wa„'^h{n) hn-2 


Y {V, v) 

i=i j=o 

(a^h ,g^h ) 

\ i — l,n i — l,n J 

tends to 0 in probability. Using third step together with standard results on regular 
conditional distributions (see for instance Section 4.3 (pp. 77 — 80) in Breiman (1992)), 
we prove the lemma. 

□ 


Lemma 15. We have 


a. 


'y E /i 

w-1,) 


i^-An 




dJhJ Hn (E.t. Wtn]) 


-1 


7 

ieAn 


kAV 


•> Qt^ ^ 

l.n l.n 


AU 


-1 


+ Op(l). 


Proof. First step ; Defining 


u. 


i^n 




Aq : 

Til := E 


i—l,n 


‘ i—l,n 




-1 




-1 


we have that Aq = Ai + Op (1). To show this, in light of Lemma 11, we have that 


Eh 




Hi 
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where Cn tends to 0 in probability. From this, we can easily show that Ao = Ai + Op (1). 


Second step : We have that 






-1 


+ Op(l)- 


To prove it, we can mimic the proof of Lemma 14, together with Lemma 11 


□ 


8.3 Computation of the limits of and (M”, 


i&An 
. -2 


(am; 


2 p 

EE/ 


+ Op(l) 


= 




= 


u=2 

AV 


+ Op(l) 


ieA„ j=0 


r _ ’ 9t!^ , _ ) 


' i — l,n ' i — l,n 


X, dTli-ij^n {x, u) + Op(l), 


where we used Lemma 2.2.11 of Jacod and Protter (2012) in first equality. Lemma 12 


in second equality. Lemma 13 in third equality. 


We deduce (using Lemma in first equality and Lemma 15 in third equality) 

{M"), = alThvH', + 0 ,( 1 ) 

f ^ i — l,n 


i^A-n 


= a 


E® 

i^An 

E® 


-h 

i — l.n 


(E,,., [Ar^] 

i — l,n ’ ' 7,— I >- ’ 


-1 


Op(l) 


.h 

i — l,n 


ieAr, 


AV 


i^n \vr‘ 


-1 


+ Op(l). 


Using Lemma 2.2.11 of Jacod and Protter (2012) again, we deduce 


(Af’>>, = A Hr. Hr ) + 0,(1)- 

f ^ i — l,n ’ \ / 

i^An 
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Using Lemma [^together with Prop. 1.4.44 (page 51) in Jacod and Shiryaev (2003), 
we obtain 

(4,1)-'ds. (77) 

Using the same approximations and computations, we also compute 

(Ar,A'<‘>), ^ f4f\4l)-'ds, (78) 

Jo 

(Ar.A'Py ^ [‘4f^4l)-'ds. (79) 

Jo 

8.4 Computation of the asymptotic bias and variance 

We follow the idea in 1-dimension in pp. 155-156 of Mykland and Zhang (2012), and 
define an auxiliary martingale 


A7,“ = A/,” - f‘ k^dXl'i - f 
Jo Jo 


where Xl’~^ is defined in ( |39[ ). Using ( |78[ ), we deduce 

rt 




'0 


-)■ 


(01) ^ds- /cp) (af ds. 


Hence, we choose 




By the same techniques that we used to compute (78), we have that 


'0 


10 


(80) 
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Using (79) and (80) we compute 




p 

-)■ 


Jo 

Jo Jo 

(M^X(2))-(M^ f f kyd{X^’^)s 

Jo Jo 



( 01 ) 


Hence, we choose 






By (H4), there exists S' > 0 such that the S Brownian motions {D^^\ ..., generate 
the filtration {Xt)^^Q- To show that (M"’, tends to 0 in probability, we decompose 

/)(*) = _D®’i + Zi )*>2 ]js,i bgiongs to the space spanned by is 

orthogonal to this space. By what precedes, we have clearly {M'^, tends to 0 in 

probability. Also, is a martingale that is, conditionally on the observations times 
of both processes, independent of M"". Thus we also deduce that (M”, converges 

to 0 in probability. 

We can now compute 


{M^)t = (M" 


ki^UxP 


kj dX^' )t 

-t 


{An.+ / W‘TW‘T<i.+ / pf)Vi-(pP)AA,p)^<;s 


-2 / / k\-^d{X'’^,M’'), 


-)■ 


2 


+ 2 


(ai'))"' - (ki + kix) («-' 


+ (kW? + (-f)' (l - (PJS') 
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By letting 

AV. = (.#.y +2 + (.#.;)-■ 

we deduce using Theorem 2.28 in Mykland and Zhang (2012) that stably in law as 

0!n 0, 


a. 


-MRCVt,n- RCVt) ^ I ki^Uxi^^+ I kydxy+ I {AVsf^dWs. 


We have just shown Theorem 1, Now, we express the asymptotic bias AB^ = f* ks^'^dXg^^-h 
f* kl’-‘-dX^’-‘- differently as 


AB, = 


/ / ky(i-(pi’^)yRa(^^dBy 

Jo Jo 

Jo Jo Jo 

+ [kp {i-{pPf)"'\f'dwy 

-kPpPaP fkPdXP. 


We thus deduce the expression of AB'i^^ and ABf‘\ 


The proof of Corollary follows in the same way as the proof of Theorem We 

hold constant the asymptotic variance and the asymptotic bias on blocks of size hn- 

--(1) --(2) — 

Moreover, we can see that AB^^^, ^Bia AVi,a are uniformly consistent estimators 
under the constant model. 


8.5 Discussion on the adaptation of Theorem proofs for more 
general models 


We discuss in this section how to adapt the proofs of Theorem when considering 
Example]^ up to ExampleIn that case, the HBT can be defined for each fc = 1, 2 as 
To,n ■= 0 and recursively as 


:= inf 

' i.n 


t > 


(k) 


i—l,n 


: AX 


{t,k) 


i 


aX) (a 
^^t,n V'' 


{t - } (81) 
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for any positive integer i. In (81), the grid := depends on n, thus 

the term in the asymptotic variance obtained in Theorem will have a different 
interpretation. Indeed, gl^^ will be seen as a (possibly multidimensional) continuous 


time-varying parameter which generates (81) instead of the scaled grid function itself. 


In particular, the approximations will not be carried with holding gt ^ constant on each 
block, but rather with holding gt constant on each block. Also, for any hxed t G [0,1], 

ik) _L 

gi will not be a function on M+, but a simple vector. The reader can refer to Potiron 
(2016) for the notion of time-varying parameter. Note that Assumption (A3) is only 


used in Lemma IT and Lemma Thus, Lemma IT and Lemma M are the only parts 
in the proof which need to be adapted. 


8.5.1 Example]^ (hitting constant boundaries of the jump size) 

For each asset A; = 1, 2 we dehne the jump sizes as We assume that and are 


independent of each other. We have that g^^^ i-^) ■= (—for t G ]• 

We also have a non-time varying parameter gt := 1. 

As the jump size are IID and independent of the other quantities, we can 
consider the same when making local approximations. Note that in Lemma [h 
the proof is made recursively for each observation time of the block. Thus a "jump" o: 
gt^n is not a problem when it happens exactly at observation times, as long as the same 
jump is also made in the approximation block. Since is assumed to be bounded, it 


qfc) 


-ik) 


-(fc) 


is straightforward to adapt the proof of Lemma 11 


We discuss now how to adapt the proof of Lemma 14 To do that, we consider 

l[}\ where i' is such that 




1C 


the Markov chain Zt := 

j' is such that and are IID sequences independent of 


each other which follows respectively the distribution of and Lfl- Then, everything 


( 2 ) 


follows the same way as in the proof of Lemma 14 


8.5.2 Example 1^ (model with uncertainty zones) 

This model is very similar to Example except that the sequence is obtained as a 
function of where corresponds to the continuous time-varying parameter Xt 
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of the /cth asset introduced in p. 5 of Robert and Rosenbaum (2012). We thus consider 
:= The proof of Lemmacan be extended using the convenient construction 
of provided in p. 11 of Robert and Rosenbaum (2012). We extend this construction 
in two-dimension assuming that (hL/)T^ and (W/)^^^ independent. As Example]^ 
is slightly more involved than Example the Markov chain Zi needs to include also 
the type of previous price change (increment or decrement) for each asset. We thus 
consider Zi := (AX 


( 2 ) 

nc,- -ICO 


r. 


1C 


L(./\sign(AX)Ji^)),sign(AX)(j))), and 


( 1 ) 


K2) 


can follow the same line of reasoning as in Lemma 14 


8.5.3 Example (times generated by hitting an irregular grid model) 

In this case, the parameter := 1 is non time-varying. Lemma [ll| can be adapted 
easily. To show Lemma 14, a further condition is needed on := p) ^ — Pj_i- We 
assume that there exists a positive number such that for any non-negative number 
j and any / G {0, • • • , — 1} we have q^Q(k)_^_i = ■ We also define the Markov chain 

Zi := (AX^c- ~ where is the index such that there exists 

a non-negative number m with and is the index such that there 

(2) ~ (2) 

exists a non-negative number m with p1^q(2)_^_i(2) = Under this assumption, we 

can show Lemma 


8.5.4 Example (structural autoregressive couditioual duratiou model) 

We assume that the mixing variables and are interpolated by time-varying 
continuous stochastic parameters We have that := The 

central limit theorem in Example can be obtained as a straightforward corollary 
of Theorem ^ If we define for any s > 0 the grid functions gl^\s) := 
the only difference between the HBT model ([^ and the structural ACD model @ is 
that we hold the grid between two observations in the latter model. In view of this 
specific assumption which implicates that the quantities of approximation are closer 
to the approximated quantities than under the HBT model, the proof of Lemma 11 


simplifies. The proof of Lemma 14 remains unchanged as it deals only with quantities 
of approximation. 
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8.6 Jump case: proof of Remark 


We update in this section the proof in the jump case model (14). The idea is to exclude 


all the blocks where we observe a jump. Such blocks will be hnitely counted, and we 
will have at most one jump (either for or for but not for both prices at the 
same time) in each block. This is the main difference with the one-dimensional case. 

We introduce the notation 

:={*>! s.t. „ < t and there is no jumps on }. 

The proof of Lemma can be adapted because of the hniteness of jumps. The proof 
of Lemma remains unchanged. Lemma and Lemma remains true in view of the 
hniteness of jumps. Lemma and Lemma don’t need any change. We modify 


Lemma 11 as follows. Let / > 1, we have that 


sup E 


-1C 


- At 

l,J,n ^ 


and 


sup 

, 2<j<h„ 


E 


At 


ic-,+ 


i,J,n 


- At 


ic,-,+ 


i,J,n 


= Op 


— Op (q!„ ) 


The proof remains unchanged in view of the independence assumption between jumps 


and the other quantities. Lemma stays true with no further change. We introduce 
the new following lemma to be inserted between Lemmaj^and Lemma[^in the proofs. 


Lemma 16. ITe have 

n-1,1 


a. 


-2 


i&An 


^ ^ {i — l)hn-\-U^{i — l)hn-\-U+l 


u=2 


= a 


-2 




ieA 


(no) 




u=2 


+ Op(i) 


Proof. This is a simple consequence to the fact that we have at most one jump in 

'^i,n 

( 2 ') 

or AX\c _ + asymptotically, together with the hniteness of jumps. □ 
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16 


Starting from Lemma [T^ up to the end of the proof of Theorem [T| in view of Lemma 
we can use G in lieu of "z G . We have thus proved that Theorem is 


robust to jumps. 


References 

[1] Ai't-Sahalia, Y., J. Fan and D. Xiu (2010) High-frequency covariance estimates 
with noisy and asynchronous financial data, Journal of the American Statistical 
Association 105, 1504-1517. 

[2] Aldous, D.J. and G.K. Eagleson (1978) On mixing and stability of limit theorems. 
Annals of Probability 6, 325-331. 

[3] Andersen, T. G., D. Dobrev and E. Schaumburg (2012). Jump-robust volatility 
estimation using nearest neighbor truncation. Journal of Econometrics, 169(1), 
75-93. 

[4] Barndorff-Nielsen, O.E., P.R. Hansen, A. Lunde and N. Shephard (2011) Multivari¬ 
ate realised kernels: consistent positive semi-definite estimators of the covariation 
of equity prices with noise and non-synchronous trading. Journal of Econometrics 
162, 149-169. 

[5] Barndorff-Nielsen, O.E. and N. Shephard (2001) Non-gaussian Ornstein- 
Uhlenbeck-based models and some of their uses in financial economics. Journal 
of the Royal Statistical Society B 63, 167-241. 

[6] Barndorff-Nielsen, O.E. and N. Shephard (2002) Econometric analysis of realized 
volatility and its use in estimating stochastic volatility models. Journal of the Royal 
Statistical Society B 64, 253-280. 

[7] Borodin, A.N. and P. Salminen (2002) Handbook of Brownian Motion - Eacts and 
Eormulae. Probability and Its applications, Basel : Birkhauser. 

[8] Breiman, L. (1992) Probability. Glassies in Applied Mathematics 7, Society for 
Industrial and Applied Mathematics (SIAM), Philadelphia, PA. 


58 



[9] Christensen, K., S. Kinnebrock and M. Podolskij (2010) Pre-averaging estimators 
of the ex-post covariance matrix in noisy diffnsion models with non-synchronons 
data, Journal of Econometrics 159, 116-133. 

[10] Christensen, K., M. Podolskij and M. Vetter (2013) On covariation estimation for 
mnltivariate continnons ltd semimartingales with noise in non-synchronons obser¬ 
vation schemes. Journal of Multivariate Analysis, 120, 59-84. 

[11] Epps, T.W. (1979) Comovements in stock prices in the very short rnn. Journal of 
the American Statistical Association 74, 291-298. 

[12] Fnkasawa, M. (2010a) Central limit theorem for the realized volatility based on 
tick time sampling. Finance and Stochastics 14, 209-233. 

[13] Fnkasawa, M. (2010b) Realized volatility with stochastic sampling. Stochastic pro¬ 
cesses and their Applications 120, 829-552. 

[14] Fnkasawa, M. and M. Rosenbanm (2012). Central limit theorems for realized 
volatility nnder hitting times of an irregnlar grid. Stochastic processes and their 
applications 122 (12), 3901-3920. 

[15] Genon-Catalot, V. and J. Jacod (1993) On the estimation of the diffnsion coefficient 
for mnltidimensional diffnsion. Annales de I’Institut Henri Poincare Probabilites et 
Statistiques 29, 119-151 

[16] Hall, P. and C.C. Heyde (1980) Martingale Limit Theory and Its Application. 
Academic Press. 

[17] Hayashi, T., J. Jacod and N. Yoshida. Irregnlar sampling and central limit theorems 
for power variations: the continnons case. Annales de I’Institut Henri Poincare 
Probabilites et Statistiques 47, 1197-1218. 

[18] Hayashi, T. and S. Knsnoka (2008) Consistent estimation of covariation nnder 
nonsynchronicity. Statistical Inference for Stochastic Processes 11.1, 93-106. 

[19] Hayashi, T. and N. Yoshida (2005) On covariance estimation of non-synchrononsly 
observed diffnsion processes. Bernoulli 11, 359-379. 


59 



[20] Hayashi, T. and N. Yoshida (2008) Asymptotic normality of a covariance estima¬ 
tor for nonsynchrononsly observed diffnsion processes. Annals of the Institute of 
Statistical Mathematics 60, 357-396. 

[21] Hayashi, T. and N. Yoshida (2011) Nonsynchronons covariation process and limit 
theorems. Stochastic processes and their applications 121, 2416-2454. 

[22] Jacod, J. (1994) Limit of Random Measures Associated with the Increments of a 
Brownian Semi-martingale. Technical report, Universite de Paris VI. 

[23] Jacod, J. and P. Protter (1998) Asymptotic error distributions for the Euler method 
for stochastic differential equations. Annals of Probability 26, 267-307. 

[24] Jacod, J. and P. Protter (2012) Discretization of Processes. Springer. 

[25] Jacod, J. and A. Shiryaev (2003) Limit Theorems For Stochastic Processes (2nd 
ed.). Berlin: Springer-Verlag. 

[26] Koike, Y. (2014) Limit theorems for the pre-averaged Hayashi-Yoshida estimator 
with random sampling. Stochastic processes and their applications 124 (8), 2699- 
2753. 

[27] Koike, Y. (2015) Time endogeneity and an optimal weight function 
in pre-averaging covariance estimation. Preprint, Available at arXiv: 
http: / / arxiv.org/abs/1403.7889v2, 

[28] Li, Y., P.A. Mykland, E. Renault, L. Zhang and X. Zheng (2014) Realized volatility 
when sampling times are possibly endogenous. Econometric Theory 30, 580-605. 

[29] Meyn, S.P. and R.L. Tweedie (2009) Markov Chains and Stochastic Stability. Cam¬ 
bridge University Press. 

[30] Mykland, P.A. (2012). A Gaussian calculus for inference from high frequency data. 
Annals of finance, 8(2-3), 235-258. 

[31] Mykland, P.A., and L. Zhang (2006) ANOVA for Diffusions and ltd Processes. 
Annals of Statistics 34, 1931-1963. 


60 



[32] Mykland, P.A. and L. Zhang (2009) Inference for Continuous Seminiartingales 
Observed at High Frequency. Econometrica 77, 1403-1445. 

[33] Mykland, P.A. and L. Zhang (2012) The econometrics of High Frequency Data. In 
M. Kessler, A. Lindner and M. Sqrensen (eds.). Statistical Methods for Stochastic 
Differential Equations, pp. 109-190. Chapman nad Hall/CRC Press. 

[34] Potiron, Y. (2016). Estimating the integrated parameter of the locally parametric 
model in high-frequency data. arXiv preprint arXiv:1603.05700. 

[35] Potzelberger, K. and L. Wang (2001) Boundary crossing probability for Brownian 
motion. Journal of Applied Probability 38, 152-164. 

[36] Renault, E., T. van der Heijden and B.J. Werker (2009) A structural autoregres¬ 
sive conditional duration model. Presented at the 2010 Winter Meetings of the 
Econometric Society in Atlanta. 

[37] Renault, E., T. van der Heijden and B.J. Werker (2014). The dynamic mixed 
hitting-time model for multiple transaction prices and times. Journal of Econo¬ 
metrics 180, 233-250. 

[38] Renyi, A. (1963) On stable sequences of events. Sanky Series A 25, 293-302. 

[39] Revuz, D. and M. Yor (1999) Continuous Martingales and Brownian motion. 3rd 
ed., Germany: Springer. 

[40] Rootzen, H. (1980) Limit distributions for the error in approximations of stochastic 
integrals. Annals of Probability 8, 241-251. 

[41] Robert, C.Y. and M. Rosenbaum (2011) A new approach for the dynamics of 
ultra-high-frequency data: the model with uncertainty zones. Journal of Pinancial 
Econometrics 9, 344-366. 

[42] Robert, C.Y. and M. Rosenbaum (2012) Volatility and covariation estimation when 
microstructure noise and trading times are endogenous. Mathematical Einance 22 
(1), 133-164. 


61 



[43] Zhang, L. (2001) From Martingales to ANOVA ; Implied and Realized Volatility. 
Ph.D. Thesis, The University of Chicago, Department of Statistics. 

[44] Zhang, L. (2011) Estimation covariation; Epps effect, microstrnctnre noise. Journal 
of Econometrics 160, 33-47. 


62 



Illustration of the HBT model 



Figure 1: This is an illustration of the HBT model when starting at time tq = 0 and 
with Xq = 100. The black stochastic process represents Xt, the red line stands for 
100 + Ut{t) and the blue line for 100 + dt{t). Furthermore, we assume that = Xt. 
The second observation ri is obtained when Xt crosses the red line for the hrst time. 
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Figure 2: Histogram and Normal QQ-plot of the standardized estimates (50) in setting 
1 on a 10-year period of observations. 
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No. years 

estim 

setting 

sample bias 

RMSE 

% Reduced RMSE 

1 

HY 

1 

5.41e-07 

1.36e-05 

- 

1 

BCHY 

1 

5.43e - 07 

1.19e-05 

13% 

5 

HY 

1 

l.lOe-07 

1.42e - 05 

- 

5 

BCHY 

1 

1.07e-07 

1.26e-05 

11% 

10 

HY 

1 

5.54e - 08 

1.39e-05 

- 

10 

BCHY 

1 

5.53e-08 

1.20e-05 

14% 

1 

HY 

2 

5.47e - 07 

1.66e-05 

- 

1 

BCHY 

2 

5.44e - 07 

1.50e-05 

9% 

5 

HY 

2 

1.13e-07 

1.71e-05 

- 

5 

BCHY 

2 

1.15e-07 

1.58e-05 

8% 

10 

HY 

2 

5.58e-08 

1.70e-05 

- 

10 

BCHY 

2 

5.60e-08 

1.57e-05 

8% 

1 

HY 

3 

5.61e-07 

1.80e-05 

- 

1 

BCHY 

3 

5.62-07 

1.67e-05 

7% 

5 

HY 

3 

1.14e-07 

1.81e-05 

- 

5 

BCHY 

3 

1.12e-07 

1.68e-05 

7% 

10 

HY 

3 

5.56e-08 

1.80e-05 

- 

10 

BCHY 

3 

5.55e-08 

1.68e-05 

7% 

1 

HY 

4 

4.41e-07 

l.lOe-05 

- 

1 

BCHY 

4 

4.44e - 07 

l.lle-05 

-1% 

5 

HY 

4 

8.81e-08 

l.lOe-05 

- 

5 

BCHY 

4 

8.80e-08 

1.09e-05 

1% 

10 

HY 

4 

4.39e - 08 

1.08e-05 

- 

10 

BCHY 

4 

4.43e - 08 

1.08e-05 

0% 


Table 1: Summary statistics based on simulated endogenous data of 1, 5 and 10 years. 
The RMSE in the table corresponds to the square root of the squared distance between 
the estimated value and the true value 6.4e — 05. HY stands for the usual Hayashi- 
Yoshida estimator (|^, and BCHY represents the bias-corrected estimator (15). 
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Table 2: 
statistic 
Ar(0,l). 


No. years 

0.5 % 

2.5 % 

5 % 

95 % 

97.5 % 

99.5 % 

1 

-2.48 

-1.99 

-1.59 

1.66 

2.13 

2.57 

5 

- 2.60 

-1.96 

-1.64 

1.64 

2.05 

2.62 

10 

- 2.68 

- 1.98 

-1.60 

1.65 

2.01 

2.73 


In this table, we report the finite sample quartiles of the feasible standardized 


(50) in 


setting 1. 


The benchmark quartiles are those for the limit distribution 
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